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IMMUNOGLOBULINS DEVOID OF LIGHT CHAINS 



The invention relates to new isolated 
immunoglobulins which are devoid of light polypeptide 
chains. These immunoglobulins do not consist in the 
degradation products of immunoglobulins composed of 
both heavy polypeptide and light polypeptide chains 
but to the contrary, the invention defines a new 
member of the family of the immunoglobulins, 
especially a new type of molecules capable of being 
involved in the immune recognition. Such 
immunoglobulins can be used for several purposes, 
especially for diagnosis or therapeutical purposes 
including protection against pathological agents or 
regulation of the expression or activity of proteins. 

Up to now the structure proposed for 
immunoglobulins consists of a four-chain model 
referring to the presence of two identical light 
polypeptide chains (light chains) and two identical 
heavy polypeptide chains (heavy chains) linked 
together by disulfide bonds to form a y- or T-shaped 
macromolecules. These chains are composed of a 
constant region and a variable region, the constant 
region being subdivided in several domains. The two 
heavy polypeptide chains are usually linked by 
disulphide bounds in a so-called "hinge region" 
situated between the first and second domains of the 
constant region . 

Among the proteins forming the class of the 
immunoglobulins, most of them are antibodies and 
accordingly present an antigen binding site or several 
antigen binding sites. 

According to the four-chain model, the antigen 
binding site of an antibody is located in the variable 
domains of each of the heavy and light chains, and 
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requires the association of the heavy and the light 
chains variable domains. 

For the definition of these four-chain model 
immunoglobulins, reference is made to Roitt. I et al 
(Immunology-second-Edition Gower Medical Publishing 
USA, 1989) . Reference is especially made to the part 
concerning the definition of the four-chain 
immunoglobulins, their polypeptidic and genetic 
structures, the definition of their variable and 
constant regions and the obtention of the fragments 
produced by enzymatic degradation according to well 
known techniques. 

The inventors have surprisingly established that 
different molecules can be isolated from animals which 
naturally produce them, which molecules have 
functional properties of immunoglobulins these 
functions being in some cases related to structural 
elements which are distinct from those involved in the 
function of four-chain immunoglobulins due for 
instance to the absence of light chains. 

The invention relates to two-chain model 
immunoglobulins which neither correspond to fragments 
obtained for instance by the degradation in particular 
the enzymatic degradation of a natural four-chain 
model immunoglobulin, nor correspond to the expression 
in host cells, of DNA coding for the constant or the 
variable region of a natural four-chain model 
immunoglobulin or a part of these regions, nor 
correspond to antibodies produced in lymphopaties for 
example in mice, rats or human. 

E.S. Ward et al (1) have described some 
experiments performed on variable domains of heavy 
polypeptide chains (V H ) or/and light polypeptide 
chains (V K /F V ) to test the ability of these variable 
domains, to bind specific antigens. For this purpose, 
a library of V H genes was prepared from the spleen 
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genomic DNA of mice previously immunized with these 
specific antigens. 

Ward et al have described in their publication 
that V H domains are relatively sticky , presumably due 
to the exposed hydrophobic surface normally capped by 
the V K or V A domains. They consequently envisage that 
it should be possible to design V H domains having 
improved properties and further that V H domains with 
binding activities could serve as the .building blocks 
for making variable fragments (Fv fragments) or 
complete antibodies. 

The invention does not start from the idea that 
the different fragments (light and heavy chains) and 
the different domains of these fragments of four-chain 
model immunoglobulin can be modified to define new or 
improved antigen binding sites or a four-chain model 
immunoglobulin . 

The inventors have determined that 
immunoglobulins can have a different structure than 
the known four-chain model and that such different 
immunoglobulins offer new means for the preparation of 
diagnosis reagents, therapeutical agents or any other 
reagent for use in research or industrial purposes. 

Thus the invention provides new immunoglobulins 
which are capable of showing functional properties of 
four-chain model immunoglobulins although their 
structure appears to be more appropriate in many 
circumstances for their use, their preparation and in 
some cases for their modification. Moreover these 
molecules can be considered as lead structures for the 
modification of other immunoglobulins. The advantages 
which are provided by these immunoglobulins comprise 
the possibility to prepare them with an increased 
facility. 

The invention accordingly relates to 
immunoglobulins characterized in that they comprise 
two heavy polypeptide chains sufficient for the 
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formation of a complete antigen binding site or 
several antigen binding sites, these immunoglobulins 
being further devoid of light polypeptide chains. In a 
particular embodiment of the invention, these 
immunoglobulins are further characterized by the fact 
that they are the product of the expression in a 
prokaryotic or in a eukaryotic host cell, of a DNA or 
of a cDNA having the sequence of an immunoglobulin 
devoid of light chains as obtainable from lymphocytes 
or other cells of Camelids. 

The immunoglobulins of the invention can be 
obtained for example from the sequences which are 
described in figure 7. 

The immunoglobulins of the invention, which are 
devoid of light chains are such that the variable 
domains of their heavy chains have properties 
differing from those of the four-chain immunoglobulin 
V H . The variable domain of a heavy-chain 
immunoglobulin of the invention has no normal 
interaction sites with the V L or with the C H 1 domain 
which do not exist in the heavy chain immunoglobulins, 
it is hence a novel fragment in many of its properties 
such as solubility and position of the binding site. 
For clarity reasons we will call it V HH in this text 
to distinguish it from the classical V H of four-chain 
immunoglobulins. 

By "a complete antigen binding site 11 it is meant 
according to the invention, a site which will alone 
allow the recognition and complete binding of an 
antigen. This could be verified by any known method 
regarding the testing of the binding affinity. 

These immunoglobulins which can be prepared by 
the technique of recombinant DNA, or isolated from 
animals, will be sometimes called "heavy-chain 
immunoglobulins 1 ' in the following pages. In a 
preferred embodiment of the invention, these 
immunoglobulins are in a pure form. 
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In a first embodiment, the immunoglobulins of the 
invention are obtainable in prokaryotic cells, 
especially in E.coli cells by a process comprising the 
steps of : 

a) cloning in a Bluecript vector of a DNA or cDNA 
sequence coding for the V HH domain of an 
immunoglobulin devoid of light chain obtainable 
for instance from lymphocytes of Camelids, 

b) recovering the cloned fragment after 
amplification using a 5* primer containing an Xho 
site and a 3 1 primer containing the Spe site 
having the following sequence 

TC TTA ACT AGT GAG GAG ACG GTG ACC TG, 

c) cloning the recovered fragment in phase in the 
immuno PBS vector after digestion of the vector 
with Xho and Spe restriction enzymes, 

d) transforming host cells, especially E.coli by 
trans feet ion with the recombinant immuno PBS 
vector of step c, 

e) recovering the expression product of the V HH 
coding sequence, for instance by using antibodies 
raised against the dromadary V HH domain. 

In another embodiment the immunoglobulins are 
hetero-specif ic immunoglobulins obtainable by a 
process comprising the steps of: 

obtaining a first DNA or cDNA sequence coding for 
a V HH domain or part thereof having a determined 
specificity against a given antigen and comprised 
between Xho and Spe sites, 

obtaining a second DNA or cDNA sequence coding 
for a V HH domain or part thereof, having a 
determined specificity different from the 
specificity of the first DNA or cDNA sequence and 
comprised between the Spe and EcoRI sites, 
digesting an immuno PBS vector with Eco RI and 
Xho I restriction enzymes, 
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coding for V HH domains, so that the DNA or cDNA 
sequences are serially cloned in the vector, 
transforming a host cell, especially E.coli cell 
by transfection, and recovering the obtained 
immunoglobul ins - 

In another embodiment, the immunoglobulins are 
obtainable by a process comprising the steps of: 

obtaining a DNA or cDNA sequence coding for a V HH 
domain or part thereof, having a determined 
specific antigen binding site, 

amplifying the obtained DNA or cDNA, using a 5' 
primer containing an initiation codon and a 
Hindlll site, and a 3' primer containing a 
termination codon having a Xho l site, 
recombining the amplified DNA or cDNA into the 
Hindlll (position 2650) and Xho l (position 4067) 
sites of- a plasmid pMM984, 

transfecting permissive cells especially NB-E 
cells with the recombinant plasmid, 
recovering the obtained products. 

Successful expression can be verified with 
antibodies directed against a region of a V HH domain, 
especially by an ELISA assay. 

According to another particular embodiment of 
this process, the immunoglobulins are cloned in a 
parvovirus . 

In another example these immunoglobulins are 
obtainable by a process comprising the further cloning 
of a second DNA or cDNA sequence having another 
determined antigen binding site, in the pMM984 
plasmid . 

Such an Immunoglobulin can be further 
characterized in that it is obtainable by a process 
wherein the vector is Yep 52 and the transformed 
recombinant cell is a yeast especially S . cerevisiae . 
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A particular Immunoglobulin is characterized in 
that it has a catalytic activity, especially in that 
it is directed against an antigen mimicking an 
activiated state, of a given substrate. These catalytic 
antibodies can be modified at the level of their 
biding site, by random or directed mutagenesis in 
order to increase oe modify their catalytic function. 
Reference may be made to the publication of Lerner et 
al (TIBS November 1987. 427-430) for the general 
technique for the preparation of such catalytic 
immunoglobulins. 

According to a preferred embodiment, the 
immunoglobulins of the invention are characterized in 
that their variable regions contain in position 4 5 , an 
amino-acid which is different from leucine, proline or 
glutamine residue. 

Moreover the heavy-chain immunoglobulins are not 
products characteristic of lymphocytes of animals nor 
from lymphocytes of a human patient suffering from 
lymphopathies. Such immunoglobulins produced in 
lymphopathies are monoclonal in origin and result from 
pathogenic mutations at the genomic level. They have 
apparently no antigen binding site. 

The two heavy polypeptide chains of these 
immunoglobulins can be linked by a hinge region 
according to the definition of Roitt et al. 

In a particular embodiment of the invention, 
immunoglobulins corresponding to the above-defined 
molecules are capable of acting as antibodies. 

The antigen binding site(s) of the 
immunoglobulins of the invention are located in the 
variable region of the heavy chain. 

In a particular group of these immunoglobulins 
each heavy polypeptide chain contains one antigen 
binding site on its variable region, and these sites 
correspond to the same amino-acid sequence. 
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In a further embodiment of the invention the 
immunoglobulins are characterized in that their heavy 
polypeptide chains contain a variable region (V HH ) and 
a constant region (C H ) according to the definition of 
Roitt et al, but are devoid of the first domain of 
their constant region. This first domain of the 
constant region is called C H 1. 

These immunoglobulins having no C H 1 domain are 
such that the variable region of their chains is 
directly linked to the hinge region at the Oterminal 
part of the variable region. 

The immunoglobulins of the type described here- 
above can comprise type G immunoglobulins and 
especially immunoglobulins which are defined as 
immunoglobulins of class 2 (IgG2) or immunoglobulins 
of class 3 (IgG3) . 

The absence of the light chain and of the first 
constant domain lead to a modification of the 
nomenclature of the immunoglobulin fragments obtained 
by enzymatic digestion, according to Roitt et al. 

The terms Fc and pFc on the one hand, Fc 1 and 
pFc' on the other hand corresponding respectively to 
the papain and pepsin digestion fragments are 
maintained . 

The terms Fab F(ab) 2 F(ab') 2 Fabc, Fd and Fv are. 
no longer applicable in their original sense as these 
fragments have either a light chain, the variable part 
of the light chain or the C H 1 domain. 

The fragments obtained by papain digestion and 
composed of the V HH domain and the hinge region will 
be called FV MH h or F(V HH h) 2 depending upon whether or 
not they remain linked by the disulphide bonds. 

In another embodiment of the invention, 
immunoglobulins replying to the hereabove given 
definitions can be originating from animals especially 
from animals of the camelid family. The inventors have 
found out that the heavy-chain immunoglobulins which 
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are present in camelids are not associated with a 
pathological situation which would induce the 
production of abnormal antibodies with respect to the 
four-chain immunoglobulins. On the basis of a 
comparative study of old world camelids (Camelus 
bactrianus and Camelus dromaderius l and new world 
camelids (for example Lama Paccos , Lama Glama , and 
Lama Vicugna) the inventors have shown that the 
immunoglobulins of the invention, which are devoid of 
light polypeptide chains are found in all species. 
Nevertheless differences may be apparent in molecular 
weight of these immunoglobulins depending on the 
animals. Especially the molecular weight of a heavy 
chain contained in these immunoglobulins can be from 
approximately 43 kd to approximately 47 kd f in 
particular 45 kd. 

Advantageously the heavy-chain immunoglobulins of 
the invention are secreted in blood of camelids. 

Immunoglobulins according to this particular 
embodiment of the invention are obtainable by 
purification from serum of camelids and a process for 
the purification is described in details in the 
examples. In the case where the immunoglobulins are 
obtained from Camelids, the invention relates to 
immunoglobulins which are not in their natural 
biological environment . 

According to the invention immunoglobulin IgG2 as 
obtainable by purification from the serum of camelids 
can be characterized in that : 

it is not adsorbed by chromatography on Protein G 

Sepharose column, 

it is adsorbed by chromatography on Protein A 
Sepharose column, 

it has a molecular weight of around 100 kd after 
elution with a pH 4.5 buffer (0.15 M NaCl , 0.58% 
acetic acid adjusted to pH 4.5 by NaOH) , 
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it consists of heavy 7 2 polypeptide chains of a 
molecular weight of around 46 kd preferably 45 
after reduction. 

According to a further embodiment of the 
invention another group of immunoglobulins 
corresponding to IgG3, as obtainable by purification 
from the serum of Camel ids is characterized in that 
the immunoglobulin : 

is adsorbed by chromatography on a Protein A 

Sepharose column, 

has a molecular weight of around 100 kd after 
elution with a pH 3.5 buffer (0.15 M NaCl, 0.58% 
acetic acid) , 

is adsorbed by chromatography on a Protein G 
Sepharose column and eluted with pH 3.5 buffer 
(0.15 M NaCl, 0.58% acetic acid). 

consists of heavy 73 polypeptide chains of a 
molecular weight of around 45 Kd in particular 
between 4 3 and 4 7 kd after reduction. 
The immunoglobulins of the invention which are 
devoid of light chains, nevertheless comprise on their 
heavy chains a constant region and a variable region. 
The constant region comprises different domains. 

The variable region of immunoglobulins of the 
invention comprises frameworks (FW) and 

complementarity determining regions (CDR) , especially 
4 frameworks and 3 complementarity regions. It. is 
distinguished from the four-chain immunoglobulins 
especially by the fact that this variable region can 
itself contain an antigen binding site or several, 
without contribution of the variable region of a light 
chain which is absent. 

The amino-acid sequences of frameworks 1 and 4 
comprise among others respectively amino-acid 
sequences which can be selected from the following : 

for the framework 1 domain 
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GGSVQTGGSLRLSCEISGLTFD 

GGSVQTGGSLRLSCAVSGFSFS 

GGSEQGGGSLRLSCAISGYTYG 

GGSVQPGGSLTLSCTVSGATYS 

GGSVQAGGSLRLSCTGSGFPYS 

GGSVQAGGSLRLSCVAGFGTS 

GGSVQAGGSLRLSCVSFSPSS 

for the framework 4 domain 

WGQGTQVTVSS 
WGQGTLVTVSS 
WGQGAQVTVSS 
WGQGTQVTASS 
RGQGTQVTVSL 

for the CDR3 domain 

ALQPGGYCGYGX CL 

VSLMDRISQH --GC 

VPAHLGPGAILDLKKY KY 

FCYSTAGDGGSGE---------MY 

ELSGGSCELPLLF DY 

DWKYWTCGAQTGGYF-------GQ 

RLTEMGACDARWATLATRTFAYNY 

QKKDRTRWAEPREW N N 

GSRFSSPVGSTSRLES-SDY--NY 
ADPSIYYSILXIEY--------KY 

DSPCYMPTMPAPPIRDSFGW--DD 

TSSFYWYCTTAPY - - N V 

TEIEWYGCNLRTTF--------TR 

NQLAGGWYLDPNYWLSVGAY AI 

RLTEMGACDARWATLATRTFAYNY 
DGWTRKEGG IGLPWSVQCEDGYNY 
DSYPCHLL--------------DV 

VEYPIADMCS- --------RY 
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As stated above, the immunoglobulins of the 
invention are preferably devoid of the totality of 
their C H 1 domain. 

Such immunoglobulins comprise C H 2 and C H 3 domains 
in the C-terminal region with respect to the hinge 
region. 

According to a particular embodiment of the 
invention the constant region of the immunoglobulins 
comprises c H 2 and C H 3 domains comprising an amino-acid 
sequence selected from the following : 
for the C H 2 domain: 
APELLGGPTVFIFPPKPKDVLSITLTP 
APELPGGPSVFVFPTKPKDVLSISGRP 
APELPGGPSVFVFPPKPKDVLSISGRP 
APELLGGPSVFIFPPKPKDVLSISGRP 
for the C H 3 domain: 
GQTREPQVYTLA 
GQTREPQVYTLAPXRLEL 
GQPREPQVYTLPPSRDEL 
GQPREPQVYTLPPSREEM 
GQPREPQVYTLPPSQEEM 

Interestingly the inventors have shown that the 
hinge region of the immunoglobulins of the invention 
can present variable lengths. When these 
immunoglobulins act as antibodies, the length of the 
hinge region will participate to the determination of 
the distance separating the antigen binding sites. 

Preferably an immunoglobulin according to the 
invention is characterized in that its hinge region 
comprises from 0 to 50 amino-acids. 

Particular sequences of hinge region of the 
immunoglobulins of the invention are the following. 

GTNEVCKCPKCP 

or, 

EPKIPQPQPKPQPQPQPQPKPQPKPEPECTCPKCP 
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The short hinge region corresponds to an IgG3 
molecule and the long hinge sequence corresponds to an 
IgG2 molecule. 

Isolated V HH derived from heavy chain 
immunoglobulins or V HH libraries corresponding to the 
heavy chain immunoglobulins can be distinguished from 
V HH cloning of four-chain model immunoglobulins on the 
basis of sequence features characterizing heavy chain 
immunoglobul ins . 

The camel heavy - chain immunoglobulin V HH region 
shows a number of differences with the V HH regions 
derived from 4 -chain immunoglobulins from all species 
examined. At the levels of the residues involved in 
the V HK /V L interactions, an important difference is 
noted at the level of position 4 5 (FW) which is 
practically always leucine in the 4-chain 
immunoglobulins (98%) , the other amino acids at this 
position being proline (1%) or glutamine (1%). 

In the camel heavy-chain immunoglobulin, in the 
sequences examined at present, leucine at position 4 5 
is only found once. It could originate from a four- 
chain immunoglobulin. In the other cases, it is 
replaced by arginine, cysteine or glutamic acid 
residue. The presence of charged amino acids at this 
position should contribute to making the V HH more 
soluble. 

The replacement by camelid specific residues such 
as those of position 45 appears to be interesting for 
the construction of engineered V HH regions derived 
from the V HH repertoire of 4-chain immunoglobulins. 

A second feature specific of the camelid V HH 
domain is the frequent presence of a cysteine in the 
CDR 3 region associated with a cysteine in the CDR 1 
position 31 or 33 or FW 2 region at position 45. The 
possibility of establishing a disulphide bond between 
the CDR 3 region and the rest of the variable domain 
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would contribute to the stability and positioning of 
the binding site. 

With the exception of a single pathogenic myeloma 
protein (DAW) such a disulphide bond has never been 
encountered in immunoglobulin V regions derived from 4 
chain immunoglobulins. 

The heavy-chain immunoglobulins of the invention 
have further the particular advantage of being not 
sticky. Accordingly these immunoglobulins being 
present in the serum, aggregate much less than 
isolated heavy chains of a four-chain immunoglobulins. 
The immunoglobulins of the invention are soluble to a 
concentration above 0.5 mg/ml, preferably above 1 
mg/ml and more advantageously above 2 mg/ml. 

These immunoglobulins further bear an extensive 
antigen binding repertoire and undergo affinity and 
specificity maturation in vivo . Accordingly they allow 
the isolation and the preparation of antibodies having 
defined specificity, regarding determined antigens. 

Another interesting property of the 
immunoglobulins of the invention is that they can be 
modified and especially humanized. Especially it is 
possible to replace all or part of the constant region 
of these immunoglobulins by all or part of a constant 
region of a human antibody. For example the C H 2 and/or 
C H 3 domains of the immunoglobulin could be replaced by 
the C H 2 and/or C M 3 domains of the IgG 7 3 human 
immunoglobul in . 

In such humanized antibodies it is also possible 
to replace a part of the variable sequence, namely one 
or more of the framework residues which do not 
intervene in the binding site by human framework 
residues, or by a part of a human antibody. 

Conversely features (especially peptide 
fragments) of heavy-chain immunoglobulin V HH regions, 
could be introduced into the V H or V L regions derived 
from four-chain immunoglobulins with for instance the 
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aim of achieving greater solubility of the 
immunoglobulins . 

The invention further relates to a fragment of an 
immunoglobulin which has been described hereabove and 
especially to a fragment selected from the following 
group : 

a fragment corresponding to one heavy polypeptide 
chain of an immunoglobulin devoid of light 
chains, 

fragments obtained by enzymatic digestion of the 
immunoglobulins of the invention, especially 
those obtained by partial digestion with papain 
leading to the Fc fragment (constant fragment) 
and leading to FV HH h fragment (containing the 
antigen binding sites of the heavy chains) or its 
dimer F(V HH h) 2 , or a fragment obtained by further 
digestion with papain of the Fc fragment, leading 
to the pFc fragment corresponding to the C- 
terminal part of the Fc fragment, 

homologous fragments obtained with other 
proteolytic enzymes, 

a fragment of at least 10 preferably 20 amino 
acids of the variable region of the 
immunoglobulin, or the complete variable region, 
especially a fragment corresponding to the 
isolated V HH domains or to the V HH dimers linked 
to the hinge disulphide, 

a fragment corresponding to the hinge region of 
the immunoglobulin, or to at least 6 amino acids 
of this hinge region, 

a fragment of the hinge region comprising a 
repeated sequence of Pro-X, 

a fragment corresponding to at least 10 
preferably 20 amino acids of the constant region 
or to the complete constant region of the 
immunoglobulin . 
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The invention also relates to a fragment 
comprising a repeated sequence, Pro-X which repeated 
sequence contains at least 3 repeats of Pro-X, X being 
any amino-acid and preferably Gin (glutamine) , Lys 
(lysine) or Glu (acide glutamique) ; a particular 
repeated fragment is composed of a 12-fold repeat of 
the sequence Pro-X. 

Such a fragment can be advantageously used as a 
link between different types of molecules. 

The amino-acids of the Pro-X sequence are chosen 
among any natural or non natural amino-acids. 

The fragments can be obtained by enzymatic 
degradation of the immunoglobulins. They can also be 
obtained by expression in cells or organisms, of 
nucleotide sequence coding for the immunoglobulins, or 
they can be chemically synthesized. 

The invention also relates to anti-idiotypes 
antibodies belonging to the heavy chain immunoglobulin 
classes. Such anti-idiotypes can be produced against 
human or animal idiotypes. A property of these anti- 
idiotypes is that they can be used as idiotypic 
vaccines, in particular for vaccination against 
glycoproteins or glycolipids and where the 
carbohydrate determines the epitope. 

The invention also relates to anti-idiotypes 
capable of recognizing idiotypes of heavy-chain 
immunoglobulins . 

Such anti-idiotype antibodies can be either 
syngeneic antibodies or allogenic or xenogeneic 
antibodies . 

The invention also concerns nucleotide sequences 
coding for all or part of a protein which amino-acid 
sequence comprises a peptide sequence selected from 
the following : 

GGSVQTGGSLRLSCEISGLTFD 
GGSVQTGGSLRLSCAVSGFSFS 
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GGSEQGGGSLRLSCAISGYTYG 

GGSVQPGGSLTLSCTVSGATYS 

GGSVQAGGSLRLSCTGSGFPYS 

GGSVQAGGSLRLSCVAGFGTS 

GGSVQAGGSLRLSCVSFSPSS 

WGQGTQVTVSS 
WGQGTLVTVSS 
WGQGAQVTVSS 
WGQGTQVTASS 
RGQGTQVTVSL 

ALQPGGYCGYGX CL 

VSLMDRISQH GC 

VPAHLGPGAILDLKKY KY 

FCYSTAGDGGSGE MY 

ELSGGSCELPLLF DY 

DWKYWTCGAQTGGYF GQ 

RLTEMGA'CDARWATLATRTFAYNY 

QKKDRTRWAEPREW NN 

GSRFSSPVGSTSRLES-SDY NY 

ADPSIYYSILXIEY KY 

DSPCYMPTMPAPPIRDSFGW DD 

TSSFYWYCTTAPY---------NV 

TEIEWYGCNLRTTF TR 

NQLAGGWYLDPNYWLSVGAY AI 

RLTEMGACDARWATLATRTFAYNY 
DGWTRKEGGIGLPWSVQCEDGYNY 

DSYPCHLL DV 

VEYPIADMCS R Y 

APELLGGPSVFVFPPKPKDVLSISGXPK 

APELPGGPSVFVFPTKPKDVLSISGRPK 

APELPGGPSVFVFPPKPKDVLSISGRPK 

APELLGGPSVFIFPPKPKDVLSISGRPK 

GQTREPQVYTLAPXRLEL 

GQPREPQVYTLPPSRDEL 

GQPREPQVYTLPPSREEM 
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GQPREPQVYTLPPSQEEM 

VTVSSGTNEVCKCPKCPAPELPGGPSVFVFP 

or, 

VTVSSEPKIPQPQPKPQPQPQPQPKPQPKPEPECTCPKCPAPELLGGPSVFIFP 
GTNEVCKCPKCP 
APELPGGPSVFVFP 

EPKIPQPQPKPQPQPQPQPKPQPKPEPECTCPKCP 
APELLGGPSVFIFP 

Such nucleotide sequences can be deduced from the 
amino-acid sequences taking into account the 
deneneracy of the genetic code. They can be 
synthesized or isolated from cells producing 
immunoglobulins of the invention. 

A procedure for the obtention of such DNA 
sequences is described in the examples. 

The invention also contemplates RNA, especially 
mRNA sequences corresponding to these DNA sequences, 
and also corresponding cDNA sequences. 

The nucleotide sequences of the invention can 
further be used for the preparation of primers 
appropriate for the detection in cells or screening of 
DNA or cDNA libraries to isolate nucleotide sequences 
coding for immunoglobulins of the invention. 

Such nucleotide sequences can be used for the 
preparation of recombinant vectors and the expression 
of these sequences contained in the vectors by host 
cells especially prokaryotic cells like bacteria or 
also eukaryotic cells and for example CHO cells, 
insect cells, simian cells like Vero cells, or any 
other mammalian cells. Especially the fact that the 
immunoglobulins of the invention are devoid of light 
chains permits to secrete them in eukaryotic cells 
since there is no need to have recourse to the step 
consisting in the formation of the BIP protein which 
is required in the four-chain immunoglobulins. 
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The inadequacies of the known methods for 
producing monoclonal antibodies or immunoglobulins by 
recombinant DNA technology comes from the necessity in 
the vast majority of cases to clone simultaneously the 
V H and V L domains corresponding to the specific 
binding site of 4 chain immunoglobulins. The animals 
and especially camelids which produce heavy-chain 
immunoglobulins according to the invention, and 
possibly other vertebrate species are capable of 
producing heavy-chain immunoglobulins of which the 
binding site is located exclusively in the V HH domain. 
Unlike the few heavy-chain immunoglobulins produced in 
other species by chain separation or by direct 
cloning, the camelid heavy-chain immunoglobulins have 
undergone extensive maturation in vivo. Moreover their 
V region has naturally evolved to function in absence 
of the V L . They are therefore ideal for producing 
monoclonal antibodies by recombinant DNA technology. 
As the obtention of specific antigen binding clones 
does not depend on a stochastic process necessitating 
a very large number of recombinant cells, this allows 
also a much more extensive , examination of the 
repertoire. 

This can be done at the level of the non 
rearranged V HH repertoire using DNA derived from an 
arbitrarily chosen tissue or cell type or at the level 
of the rearranged V HH repertoire, using DNA obtained 
from B lymphocytes. More interesting however is to 
transcribe the mRNA from antibody producing cells and 
to clone the cDNA with or without prior amplification 
into an adequate vector. This will result in the 
obtention of antibodies which have already undergone 
affinity maturation. 

The examination of a large repertoire should 
prove to be particularly useful in the search for 
antibodies with catalytic activities. 
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The invention thus provides libraries which can 
be generated in a way which includes part of the hinge 
sequence, the identification is simple as the hinge is 
directly attached to the V HH domain. 

These libraries can be obtained by cloning cDNA 
from lymphoid cells with or without prior PCR 
amplification. The PCR primers are located in the 
promoter, leader or framework sequences of the V HH for 
the 5' primer and in the hinge, CH 2 , CH 3 , 3' 
untranslated region or polyA tail for the 3' primer. A 
size selection of amplified material allows the 
construction of a library limited to heavy chain 
immunoglobulins . 

In a particular example, the following 3' primer 
in which a Kpn l site has been constructed and which 
corresponds to amino-acids 313 to 319 (CGC CAT CAA GGT 
AAC AGT TGA) is used in conjunction with mouse V HH 
primers described by Sestry et al and containing a Xho 
site 

AG GTC CAG CTG CTC GAG TCT GG 
AG CTC CAG CTG CTC GAG TCT GG 
AG GTC CAG CTT CTC GAG TCT GG 

Xho l site 

These primers yield a library of camelid heavy 
chain immunoglobulins comprising the V HH region 
(related to mouse or human subgroup III) , the hinge 
and a section of CH 2 . 

In another example, the cDNA is polyadenylated at 
its 5' end and the mouse specific V HH primers are 
replaced by a poly T primer with an inbuilt Xho l site, 
at the level of nucleotide 12 • 

CTCGAGT 12 . 

The same 3' primer with a Kpn l site is used. 

This method generates a library containing all 
subgroups of immunoglobulins. 

Part of the interest in cloning a region 
encompassing the hinge-CH 2 link is that in both 72 and 
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73 , a Sac site is present immediately after the hinge. 
This site allows the grafting of the sequence coding 
for the V HH and the hinge onto the Fc region of other 
immunoglobulins, in particular the human IgG, and IgG 3 
which have the same amino acid sequence at this site 
(Glu 246 Leu 247 ). 

As an example, the invention contemplates a cDNA 
library composed of nucleotide sequences coding for a 
heavy-chain immunoglobulin , such as obtained by 
performing the following steps: 

a) treating a sample containing lymphoid cells, 
especially periferal, lymphocytes, spleen cells, lymph 
nodes or another lyphoid tissue from a healthy animal, 
especially selected among the Camelids, in order to 
separate the lymphoid cells, 

b) separating polyadenylated RNA from the other 
nucleic acids and components of the cells, 

c) reacting the obtained RNA with a reverse 
transcriptase in order to obtain the corresponding 
CDNA, 

d) contacting the cDNA of step c) with 5 f primers 
corresponding to mouse V H domain of four-chain 
immunoglobulins, which primer contains a determined 
restriction site, for example an Xho l site and with 3' 
primers corresponding to the N-terminal part of a C H 2 
domain containing a Kpn l site, 

e) amplifying the DNA, 

f) cloning the amplified sequence in a vector, 
especially in a bluescript vector, 

g) recovering the clones hybridizing with a probe 
corresponding to the sequence coding for a constant 
domain from an isolated heavy-chain immunoglobulin. 

This cloning gives rise to clones containing DNA 
sequences including the sequence coding for the hinge. 
It thus permits the characterization of the subclass 
of the immunoglobulin and the Sac I site useful for 
grafting the FV HH h to the Fc region. 
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The recovery of the sequences coding for the 
heavy-chain immunoglobulins can also be achieved by 
the selection of clones containing DNA sequences 
having a size compatible with the lack of the C H 1 
domain. 

It is possible according to another embodiment of 
the invention, to add the following steps between 
steps c) and d) of the above process: 

- in the presence of a DNA polymerase and of 
deoxyribonucleotide triphosphates, contacting said 
cDNA with oligonucleotide degenerated primers, which 
sequences are capable of coding for the hinge region 
and N-terminal V HH domain of an immunoglobulin, the 
primers being capable of hybridizing with the cDNA and 
capable of initiating the extension of a DNA sequence 
complementary to the cDNA used as template, 

- recovering the amplified DNA. 

The clones can be expressed in several types of 
expression vectors. As an example using a 

commercially available vector Immuno PBS (Huse et al : 
Science (1989) 246, 1275), clones produced in 
Bluescript €> according to the above described 
procedure, are recovered by PCR using the same Xho l 
containing 5 1 primer and a new 3 1 primer, 
corresponding to residues 113-103 in the framework of 
the immunoglobulins, in which an Spe site has been 
constructed : TC TTA ACT AGT GAG GAG ACG GTG ACC TG. 
This procedure allows the cloning of the V HH in the 
Xho/Spe site of the Immuno PBS vector. However, the 
3* end of the gene is not in phase with the 
identification "tag" and the stop codon of the vector. 
To achieve this, the construct is cut with Spe and the 
4 base overhangs are filled in, using the Klenow 
fragment after which the vector is religated. A 
further refinement consists in replacing the marker 
("tag 11 ) with a poly histidine so that metal 
purification of the cloned V HH can be performed. To 
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achieve this a Spe/Eco RI double stranded oligo- 
nucleotide coding for 6 histidines and a termination 
codon is first constructed by synthesis of both 
strands followed by heating and annealing : 
CTA GT G CAC CAC CAT CAC CAT CAC TAA* TAG* 

AC GTG GTG GTA GTG GTA GTG ATT AT C TTA A 

The vector containing the insert is then digested 
with Spe l and Eco RI to remove the resident "tag" 
sequence which can be replaced by the poly- 
His/termination sequence. The produced V HH can 
equally be detected by using antibodies raised against 
the dromedary V HH regions. Under laboratory 

conditions, V HK regions are produced in the Immuno PBS 
vector in mg amounts per liter. 

The invention also relates to a DNA library 
composed of nucleotide sequences coding for a heavy- 
chain immunoglobulin, such as obtained from cells with 
rearranged immunoglobulin genes. 

: In a preferred embodiment of the invention, the 
library is prepared from cells from an animal 
previously immunized against a determined antigen. 
This allows the selection of antibodies having a 
preselected specificity for the antigen used for 
immunization. 

In another embodiment of the invention, the 
amplification of the cDNA is not performed prior to 
the cloning of the cDNA. 

The heavy-chain of the four-chain immunoglobulins 
remains sequestered in the cell by a chaperon protein 
(BIP) until it has combined with a light chain. The 
binding site for the chaperon protein is the C H 1 
domain. As this domain is absent from the heavy chain 
immunoglobulins, their secretion is independent of the 
presence of the BIP protein or of the light chain. 
Moreover the inventors have shown that the obtained 
immunoglobulins are not sticky and accordingly will 
not abnormally aggregate. 
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The invention also relates to a process for the 
preparation of a monoclonal antibody directed against 
a determined antigen, the antigen binding site of the 
antibody consisting of heavy polypeptide chains and 
which antibody is further devoid of light polypeptide 
chains, which process comprises : 

immortalizing lymphocytes, obtained for example 
from the peripheral blood of Camel ids previously 
immunized with a determined antigen, with an 
immortal cell and preferably with myeloma cells, 
in order to form a hybridoma, 

culturing the immortalized cells (hybridoma) 
formed and recovering the cells producing the 
antibodies having the desired specificity. 
The preparation of antibodies can also be 

performed without a previous immunization of Camelids. 
According to another process for the preparation 

of antibodies, the recourse to the technique of the 

hybridoma cell is not required. 

According to such process, antibodies are 

prepared in vitro and they can be obtained by a 

process comprising the steps of : 

cloning into vectors, especially into phages and 
more particularly filamentous bacteriophages, DNA 
or cDNA sequences obtained from lymphocytes 
especially PBLs of Camelids previously immunized 
with determined antigens, 

transforming prokaryotic cells with the above 
vectors in conditions allowing the production of 
the antibodies, 

selecting the antibodies for their heavy-chain 
structure and further by subjecting them to 
antigen-affinity selection, 

recovering the antibodies having the desired 
specificity, 

In another embodiment of the invention the 
cloning is performed in vectors, especially into 
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plasmids coding for bacterial membrane proteins. 
Procaryotic cells are then transformed with the above 
vectors in conditions allowing the expression of 
antibodies in their membrane. 

The positive cells are further selected by 
antigen affinity selection. 

The heavy chain antibodies which do not contain 
the C k 1 domain present a distinct advantage in this 

respect. Indeed, the C H 1 domain binds to BIP type 
chaperone proteins present within eukaryotic vectors 
and the heavy chains are not transported out of the 
endocytoplasmic reticulum unless light chains are 
present. This means that in eukaryotic cells, 
efficient cloning of 4-chain immunoglobulins in non 
mammalian cells such as yeast cells can depend on the 
properties of the resident BIP type chaperone and can 
hence be very difficult to achieve. In this respect 
the heavy chain antibodies of the invention which lack 
the CH y domain present a distinctive advantage. 

In a preferred embodiment of the invention the 
cloning can be performed in yeast either for the 

production of antibodies or for the modification of 
the metabolism of the yeast. As example, Yep 52 
vector can be used. This vector has the origin of 
replication (ORI) 2/i of the yeast together with a 
selection marker Leu 2. 

The cloned gene is under the control of gall 
promoter and accordingly is inducible by galactose. 
Moreover, the expression can be repressed by glucose 
which allows the obtention of very high concentration 
of cells before the induction. 

The cloning between Bam HI and Sai l sites using 
the same strategy of production of genes by PCR as the 
one described above, allows the cloning of camelid 
immunoglobulin genes in E. coli . As example of 
metabolic modulation which can be obtained by 
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antibodies and proposed for the yeast, one can site 
the cloning of antibodies directed against cyclins, 
that is proteins involved in the regulation of the 
cellular cycle of the yeast (TIBS 16 430 J.D. Mc 
Kinney, N ♦ Heintz 1991). Another example is the 
introduction by genetic engineering of an antibody 
directed against CD 2B , which antibody would be 
inducible (for instance by gall), within the genome of 
the yeast. The CD 28 is involved at the level of the 
initiation of cell division, and therefore the 
expression of antibodies against this molecule would 
allow an efficient control of multiplication of the 
cells and the optimization of methods for the 
production in bioreactors or by means of immobilized 
cells. 

In yet another embodiment of the invention, the 
cloning vector is a plasmid or a eukaryotic virus 
vector and the cells to be transformed are eukaryotic 
cells, especially yeast cells, mammalian cells for 
example CHO cells or simian cells such as Vero cells, 
insect cells, plant cells, or protozoan cells. 

For more details concerning the procedure to be 
applied in such a case, reference is made to the 
publication of Marks et al, J. Mol . Biol. 1991, 
222:581-597. 

Furthermore, starting from the immunoglobulins of 
the invention, or from fragments thereof, new 
immunoglobulins or derivatives can be prepared. 

Accordingly immunoglobulins replying to the above 
given definitions can be prepared against determined 
antigens. Especially the invention provides monoclonal 
or polyclonal antibodies devoid of light polypeptide 
chains or antisera containing such antibodies and 
directed against determined antigens and for example 
against antigens of pathological agents such as 
bacteria, viruses or parasites. As example of antigens 
or antigenic determinants against which antibodies 
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could be prepared / one can cite the envelope 
glycoproteins of viruses or peptides thereof, such as 
the external envelope glycoprotein of a HIV virus, the 
surface antigen of the hepatitis B virus. 

Immunoglobulins of the invention can also be 
directed against a protein, hapten, carbohydrate or 
nucleic acid. 

Particular antibodies according to the invention 
are directed against the galactosyla-l-3-galactose 
epitope. 

The immunoglobulins of the invention allow 
further the preparation of combined products such as 
the combination of the heavy-chain immunoglobulin or a 
fragment thereof with a toxin, an enzyme, a drug, a 
hormone. 

As example one can prepare the combination of a 
heavy-chain immunoglobulin bearing an antigen binding 
site recognizing a myeloma immunoglobulin epitope with 
the abrin or mistletoe lectin toxin. Such a construct 
would have its uses in patient specific therapy. 

Another advantageous combination is that one can 
prepare between a heavy-chain immunoglobulins 
recognizing an insect gut antigen with a toxin 
specific for insects such as the toxins of the 
different serotypes of Bacillus thuringiensis or 
Bacillus sphaericus. Such a construct cloned into 
plants can be used to increase the specificity or the 
host range of existing bacterial toxins. 

The invention also proposes antibodies having 
different specificities on each heavy polypeptide 
chains. These multifunctional, especially bifunctional 
antibodies could be prepared by combining two heavy 
chains of immunoglobulins of the invention or one 
heavy chain of an immunoglobulin of the invention with 
a fragment of a four-chain model immunoglobulin. 

The invention also provides hetero-specif ic 
antibodies which can be used for the targetting of 
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drugs or any biological substance like hormones. In 
particular they can be used to selectively target 
hormones or cytokines to a limited category of cells. 
Examples are a combination of a murine or human 
antibody raised against interleukin 2 (IL2) and a 
heavy-chain antibody raised against CD 4 cells. This 
could be used to reactivate CD 4 cells which have lost 
their ILj receptor. 

The heavy-chain immunoglobulins of the invention 
can also be used for the preparation of hetero- 
specific antibodies. These can be achieved either 
according to the above described method by reduction 
of the bridges between the different chains and 
reoxydation, according to the usual techniques, of two 
antibodies having different specificities, but it can 
also be achieved by serial cloning of two antibodies 
for instance in the Immuno pBS vector. 

In such a case, a first gene corresponding to the 
V HH domain comprised between Xho site and a Spe site 
is prepared as described above. A second gene is then 
prepared through an analogous way by using as 5' 
extremity a primer containing a Spe site, and as 3' 
extremity a primer containing a termination codon and 
an EcoRI site. The vector is then digested with Eco RI 
and Xhol and further both V HH genes are digested 
respectively by Xho/Spe and by Spe/Eco RI . 

After ligation, both immunoglobulin genes are 
serially cloned. The spacing between both genes can 
be increased by the introduction of addition codons 
within the 5* Spe l primer. 

In a particular embodiment of the invention, the 
hinge region of IgG2 immunoglobulins according to the 
invention is semi-rigid and is thus appropriate for 
coupling proteins. In such an application proteins or 
peptides can be linked to various substances, 
especially to ligands through the hinge region used as 
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spacer. Advantageously the fragment comprises at 
least 6 amino acids. 

According to the invention it is interesting to 
use a sequence comprising a repeated sequence Pro-X, X 
being any amino-acid and preferably Gin, Lys or Glu, 
especially a fragment composed of at least a 3 -fold 
repeat and preferably of a 12-fold repeat, for 
coupling proteins to ligand, or for assembling 
different protein domains. 

The hinge region or a fragment thereof can also 
be used for coupling proteins to ligands or for 
assembling different protein domains. 

Usual techniques for the coupling are appropriate 
and especially reference may be made to the technique 
of protein engineering by assembling cloned sequences. 

The antibodies according to this invention could 
be used as reagents for the diagnosis in vitro or by 
imaging techniques. The immunoglobulins of the 
invention could be labelled with radio-isotopes, 
chemical or enzymatic markers or chemiluminescent 
markers, 

As example and especially in the case of 

detection or observation with the immunoglobulins by 
imaging techniques, a label like technetium, 
especially technitium 99 is advantageous. This label 
can be used for direct labelling by a coupling 
procedure with the immunoglobulins or fragments 
thereof or for indirect labelling after a step of 
preparation of a complex with the technitium. 

Other interesting radioactive labels are for 
instance indium and especially indium 111, or iodine, 
especially I 131 , I 125 and I 123 . 

For the description of these techniques reference 
is made to the FR patent application published under 
number 2649488. 
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In these applications the small size of the V HH 
fragment is a definitive advantage for penetration 
into tissue. 

The invention also concerns monoclonal antibodies 
reacting with anti-idiotyp.es of the above-described 
antibodies. 

The invention also concerns cells or organisms in 
which heavy-chain immunoglobulins have been cloned. 
Such cells or organisms can be used for the purpose of 
producing heavy-chain immunoglobulins having a desired 
preselected specificity, or corresponding to a 
particular repertoire. They can also be produced for 
the purpose of modifying the metabolism of the cell 
which expresses them. In the case of modification of 
the metabolism of cells transformed with the sequences 
coding for heavy-chain immunoglobulins, these produced 
heavy-chain immunoglobulins are used like antisense 
DNA. Antisense DNA is usually involved in blocking the 
expression of certain genes such as for instance the 
variable surface antigen of trypanosomes or other 
pathogens. Likewise, the production or the activity of 
certain proteins or enzymes could be inhibited by 
expressing antibodies against this protein or enzyme 
within the same cell. 

The invention also relates to a modified 4-chain 
immunoglobulin or fragments thereof, the V H regions 
of which has been partialy replaced by specific 
sequences or amino acids of heavy chain 
immunoglobulins, especially by sequences of the V HH 
domain. A particular modified V H domain of a four- 
chain immunoglobulin, is characterized in that the 
leucine, proline or glutamine in position 45 of the V H 
regions has been replaced by other amino acids and 
preferably by arginine, glutamic acid or cysteine. 

A further modified V H or V L domain of a four- 
chain immunoglobulin, is characterized by linking of 
CDR loops together or to FW regions by the 
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introduction of paired cysteines, the CDR region being 
selected between the CDR, and the CDR 3 , the FW region 
being the FW 2 region, and especially in which one of 
the cysteines introduced is in position 31, 3 3 of FR 2 
or 45 of CDR 2 and the other in CDR 3 . 

Especially the introduction of paired cysteines 
is such that the CDR 3 loop is linked to the FW2 or 
CDR1 domain and more especially the cysteine of the 
CDR3 of the V H is linked to a cysteine, in position 31 
or 33 of FW2 or in position 4 5 of CDR2. 

In another embodiment of the invention, plant 
cells can be modified by the heavy-chain 
immunoglobulins according to the invention, in order 
that they acquire new properties or increased 
properties. 

The heavy-chain immunoglobulins of the invention 
can be used for gene therapy of cancer for instance by 
using antibodies directed against proteins present on 
the tumor cells. 

In such a case, the expression of one or two V HH 
genes can be obtained by using vectors derived from 
parvo or adeno viruses. The parvo viruses are 
characterized by the fact that they are devoid of 
pathogenicity or almost not pathogenic for normal 
human cells and by the fact that they are capable of 
easily multiplying in cancer cells (Russel S.J. 1990, 
Immunol. Today II. 196-200). 

The heavy-chain immunoglobulins are for instance 
cloned within Hind lll/Xbal sites of the infectious 
plasmid of the murine MVM virus (pMM984) . (Merchlinsky 
et al, 1983, J. Virol. 47, 227-232) and then placed 
under the control of the MVM38 promoter. 

The gene of the V HH domain is amplified by PCR by 
using a 5 1 primer containing an initiation codon and a 
Hindlll site, the 3' primer containing a termination 
codon and a Xbal site. 
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This construct is then inserted between positions 
2650 (Hindlll) and 4067 (Xbal) within the plasmid. 

The efficiency of the cloning can be checked by 
transfection. The vector containing the antibody is 
then introduced in permissive cells (NB-E) by 
transfection. 

The cells are recovered after two days and the 
presence of V HH regions is determined with an ELISA 
assay by using rabbit antiserum reacting with the V HH 
part. 

The invention further allows the preparation of 
catalytic antibodies through different ways. The 
production of antibodies directed against components 
mimicking activated states of substrates (as example 
vanadate as component mimicking the activated state of 
phosphate in order to produce their phosphoesterase 
activities, phosphonate as compound mimicking the 
peptidic binding in order to produce proteases) 
permits to obtain antibodies having a catalytic 
function. Another way to obtain such antibodies 
consists in performing a random mutagenesis in clones 
of antibodies for example by PCR, in introducing 
abnormal bases during the amplification of clones. 
These amplified fragments obtained by PCR are then 
introduced within an appropriate vector for cloning. 
Their expression at the surface of the bacteria 
permits the detection by the substrate of clones 
having the enzymatic activity. These two approaches 
can of course be combined. Finally, on the basis of 
the data available on the structure, for example the 
data obtained by XRay crystallography or NMR, the 
modifications can be directed. These modifications 
can be performed by usual techniques of genetic 
engineering or by complete synthesis. One advantage 
of the V HH of the heavy chain immunoglobulins of the 
invention is the fact that they are sufficiently 
soluble. 



WO 94/04678 



PCI7EP93/02214 



33 

The heavy chain immunoglobulins of the invention 
can further be produced in plant cells, especially in 
transgenics plants. As example the heavy chain 
immunoglobulins can be produced in plants using the 
pMon530 plasmid (Roger et al. Meth Enzym 153 1566 
1987) constitutive plant expression vector as has been 
described for classical four chain antibodies (Hiat et 
al. Nature 342 76-78, 1989) once again using the 
appropriate PCR primers as described above, to 
generate a DNA fragment in the right phase. 

Other advantages and characteristics of the 
invention will become apparent in the examples and 
figures which follow. 
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FIGURES 

Figure 1 : Characterisation and purification of 

camel IgG by affinity chromatography on 
Protein A and Protein G sepharose 
(Pharmacia) 

(A) shows, after reduction, the SDS-PAGE protein 
profile of the adsorbed and non adsorbed fractions of 
Camelus dromedarius serum. The fraction adsorbed on 
Protein A and eluted with NaCl 0.15 M acetic acid 
0.58% show upon reduction (lane c) three heavy chain 
components of respectively 50, 46 and 43 Kd and light 
chain (rabbit IgG in lane a) . The fractions adsorbed 
on a Protein G Sepharose (Pharmacia) derivative which 
has been engineered to delete the albumin binding 
region (lane e) and eluted with 0.1 M gly HC1 pH 2.7 
lacks the 4 5 Kd heavy chain which is recovered in the 
non adsorbed fraction (lane f ) . None of these 
components are present in the fraction non adsorbed on 
Protein A (lane d) , lane b contains the molecular 
weight markers. 

(B) and (C) By differential elution, immunoglobulin 
fractions containing the 50 and 43 Kd heavy chain can 
be separated. 5 ml of C. dromadarius serum is adsorbed 
onto a 5 ml Protein G sepharose column and the column 
is extensively washed with 20mM phosphate buffer, pH 
7.0. Upon elution with pH 3.5 buffer (0.15 M NaCl, 
0.58% acetic acid) a 100 Kd component is eluted which 
upon reduction yields a 43 Kd heavy chain, (lane 1) . 
After column eluant absorbance has fallen to 
background level a second immunoglobulin component of 
170 Kd can be eluted with pH 2 . 7 buffer (0.1 M glycine 
HC) . This fraction upon reduction yields a 50 Kd heavy 
chain and a board light chain band (lane 2). 

The fraction non adsorbed on Protein G is then brought 
on a 5 ml Protein A Sepharose column. After washing 
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and elution with pH 3.5 buffer (0.15 M NaCl , 0.58% 
acetic acid) a third immunoglobulin of 100 Kd is 
obtained which consists solely of 46 Kd heavy chains 
(lane 3) . 

Figure 2 : Immunoglobulins of Camelus bactrianus , 

Lama vicugna, Lama glama and Lama pacos 
to Protein A (A lanes) and to Protein 6 
(G lanes) analyzed on SDS-FAGE before 
(A) and after reduction (B) 

10 Ml of serum obtained from the different species 
were added to Eppendorf* tubes containing 10 mg of 
Protein A or Protein G sepharose suspended in 400 
of pH 8.3 immunoprecipitation buffer (NaCl 0.2. M, 
Tris 0.01 M; EDTA 0.01 M, Triton X100 1%, ovalbumin 
0.1%). The tubes were slowly rotated for 2 hours at 
4°C. After centrifugation the pellets were washed 3 
times in buffer and once in buffer in which the Triton 
and ovalbumin had been ommitted. The pellets were then 
resuspended in the SDS-PAGE sample solution 7 0 pi per 
pellet with or without dithiotreitol as reductant. 
After boiling for 3 min at 100°C, the tubes were 
centrifuged and the supernatants analysed. 
In all species examined the unreduced fractions (A) 
contain in addition to molecules of approximately 
170 Kd also smaller major components of approximately 
100 Kd. In the reduced sample (B) the constituant 
heavy and light chains are detected. In all species a 
heavy chain component (marked by an asterisk *) is 
present in the material eluted from the Protein A but 
absent in the material eluted from the Protein G. 

Figure 3 : IgG^, IgG 2 and IgG 3 were prepared from 

serum obtained from healthy or 
Trypanosama evansi infected Camelus 
dromedarius (CATT titer 1/160 (3) and 
analysed by radioimmunopreci- pitation 
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or Western Blotting for anti 
trypanosome activity 

(A) 35 S methionine labelled Trypanosome evansi 
antigens lysate (500.000 counts) was added to 
Eppendorf tubes containing 10 /xl of serum or, 20 /ig of 
IgG,, IgG 2 or IgGj in 200 /xl of pH 8.3 
immunoprecipitation buffer containing 0.1 M TLCK as 
proteinase inhibitor and slowly rotated at 4°C during 
one hour. The tubes were then supplemented with 10 mg 
of Protein A Sepharose suspended in 200 /xl of the same 
pH 8.3 buffer and incubated at 4°C for an additional 
hour. 

After washing and centrifugation at 15000 rpm for 
12 s, each pellet was resuspended in 75 /xl SDS-PAGE 
sample solution containing DTT and heated for 3 min. 
at 100 °C. After centrifugation in an Eppendorf 
minifuge at 15000 rpm for 30 s, 5 /xl of the 
supernatant was saved for radioactivity determination 
and the reminder analysed by SDS-PAGE and 
f luorography . The counts/5 /xl sample are inscribed on 
for ^ach line. 

(B) 20 /xg of IgG,, IgG 2 and IgG 3 from healthy and 
trypanosome infected animals were separated by SDS- 
PAGE without prior reduction or heating. The separated 
samples were then electro transferred to a 
nitrocellulose membrane, one part of the membrane was 
stained with Ponceau Red to localise the protein 
material and the reminder incubated with 1% ovalbumin 
in TST buffer (Tris 10 mM, NaCl 150 mM, Tween 0.05%) 
to block protein binding sites. 

After blocking, the membrane was extensively washed 
with TST buffer and incubated for 2 hours with 35 S- 
labelled trypanosome antigen. After extensive washing, 
the membrane was dried and analysed by 
autoradiography. To avoid background and unspecific 
binding, the labelled trypanosome lysate was filtered 
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through a 45 m millipore filter and incubated with 
healthy camel immunoglobulin and ovalbumin adsorbed on 
a nitrocellulose membrane* 

figure 4 : Purified IgG3 of the camel, by 

affinity chromatography on Protein A 
Sepharose are partially digested 
with papain and separated on Protein 
A sepharose. 

14mg of purified IgG3 were dissolved in 0.1M phosphate 
buffer pH 7.0 containing 2mM EDTA. Yhey were digested 
by 1 hour incubation at 37 °C with mercurypapain (1% 
enzyme to protein ratio) activated by 5.10 4 M 
cysteine. The digestion was blocked by the addition 
ofexcess iodoacetamide (4 . 10 2 M) (13) . After 

centrif ugation of the digest in an ependorf centrifuge 
for 5min at 15000 rpm, the papain fragments were 
separated on a protein A Sepharose column into binding 
(B) and non binding (NB) fractions. The binding 
fraction was eluted from the column with 0.1M glycine 
HC1 buffer pH 1.7. 

Figure 5 : Schematic presentation of a model for 

IgG3 molecules devoid of light chains. 

Figure 6 : . Schematic representation of immuno- 

globulins having heavy polypeptide 
chains and devoid of light chains, 
regarding conventional four-chain model 
immunoglobulin 

. Representation of a hinge region. 

Figure 7 : Alignement of 17 Vjjh DNA sequences of 

Camel heavy chain immunoglobulins 



Figure 8: 



Expression and purification of the 
camel V HH 21 protein from E.coli 
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I HEAVY CHAIN ANTIBODIES IN CAMELIDS 

When Camelus dromedarius serum is adsorbed on 
Protein G sepharose, an appreciable amount (25-35%) of 
immunoglobulins (Ig) remains in solution which can 
then be recovered by affinity chromatography on 
Protein A sepharose (fig. 1A) . The fraction adsorbed 
on Protein G can be differentially eluted into a 
tightly bound fraction (25%) consisting of molecules 
of an unreduced apparent molecular weight (MW) of 170 
Kd and a more weakly bound fraction (30-45%) having an 
apparent molecular weight of 100 Kd (fig. IB) . The 
17 0 Kd component when reduced yields 50 Kd heavy 
chains and large 30 Kd light chains. The 100 Kd 
fraction is totally devoid of light chains and appears 
to be solely composed of heavy chains which after 
reduction have on apparent MW of 43 Kd (Fig. 1C) . The 
fraction which does not bind to Protein G can be 
affinity purified and eluted from a Protein A column 
as a second 100 Kd component which after reduction 
appears to be composed solely of 46 Kd heavy chains. 

The heavy chain immoglobulins devoid of light 
chains total up to 75% of the molecules binding to 
Protein A. 

As all three immunoglobulins bind to Protein A we 

refer to them as IgG : namely IgG, (light chain and 
heavy chain yl (50 Kd) binding to Protein G , IgG 2 
(heavy chain 72 (46 Kd) non binding to Protein G and 
IgG 3 (heavy chain 73 (43 Kd) binding to Protein G. 
There is a possibility that these three sub (classes) 
can be further subdivided. 

A comparative study of old world camel ids 
( Camel us bactrianus and Camelus dromedarius ) and new 
world camelids ( lama pacos , lama qlama , lama vicugna ) 
showed that heavy chain immunoglobulins are found in 
all species examined, albeit with minor differences in 
apparent molecular weight and proportion. The new 
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world camelids differs from the old world camelids in 
having a larger IgG 3 molecule (heavy chain 
immunoglobulin binding to Protein G) in which the 
constituant heavy chains have an apparent molecular 
weight of 47 Kd (fig. 2). 

The abundance of the heavy chain immunoglobulins 
in the serum of camelids raises the question of what 
their role is in the immune response and in particular 
whether they bear antigen binding specificity and if 
so how extensive is the repertoire. This question 
could be answered by examining the immunoglobulins 
from Trypanosoma evans i infected camels (Camelus 
dromedarius ) . 

For this purpose, the corresponding fractions of 
IgG lf IgG 2 , IgG 3 were prepared from the serum of a 
healthy camel and from the serum of camels with a high 
antitrypanosome titer, measured by the Card 
Agglutination Test (3). In radio-immunoprecipitation, 
IgG lf IgG 2 and IgG 3 derived from infected camel 
indicating extensive repertoire heterogeneity and 
complexity (Fig. 3 A) were shown to bind a large number 
of antigens present in a 35 S methionine labelled 
trypanosome ly sate . 

In blotting experiments 35 S methionine labelled 
trypanosome lysate binds to SDS PAGE separated IgG u 
IgG 2 and IgG 3 obtained from infected animals (Fig. 
3B) . 

This leads us to conclude that the camelid heavy 
chain IgG 2 and IgG 3 are bona fide antigen binding 
antibodies . 

An immunological paradigm states that an 
extensive antibody repertoire is generated by the 
combination of the light and heavy chain variable V 
region repertoires (6) . The heavy chain 
immunoglobulins of the camel seem to contradict this 
paradigm. 
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Immunoglobulins are characterized by a complex 
I.E.F. (isoelectric focussing) pattern reflecting 
their extreme heterogeneity. To determine whether the 
two heavy chains constituting the IgG 2 and IgG 3 are 
identical or not, the isoelectric focussing (I.E.F.) 
pattern were observed before and after chain 
separation by reduction and alkylation using 
iodoacetamide as alkylating agent. 

As this alkylating agent does not introduce 
additional charges in the molecule, the monomers 
resulting from the reduction and alkylation of a heavy 
chain homodimer will have practically the same 
isolectric point as the dimer, whereas if they are 
derived from a heavy chain heterodiroer , the monomers 
will in most cases differ sufficiently in isoelectric 
point to generate a different pattern in I.E.F. 

Upon reduction, and alkylation by iodoacetamide 
the observed pattern is not modified for the Camelus 
dromedarius IgG 2 and IgG 3 indicating that these 
molecules are each composed of two identical heavy 
chains which migrate to the same position as the 
unreduced molecule they originated from. 

In contrast, the I.E.F. pattern of IgG, is 
completely modified after reduction as the isoelectric 
point of each molecule is determined by the 
combination of the isoelectric points of the light and 
heavy chains which after separation will each migrate 
to a different position. 

These findings indicate that the heavy chains 
alone can generate an extensive repertoire and 
question the contribution of the light chain to the 
useful antibody repertoire. If this necessity be 
negated, what other role does the light chain play. 

Normally, isolated heavy chain from mammalian 
immunoglobulins tend to aggregate considerably but are 
only solubilized by light chains (8, 9) which bind to 
the C H 1 domain of the heavy chain. 
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In humans and in mice a number of spontaneous or 
induced myelomas produce a pathological immunoglobulin 
solely composed of heavy chains (heavy chain disease) . 
These myeloma protein heavy chains carry deletions in 
the C H 1 and V HH domains (10). The reason why full 
lenght heavy chains do not give rise to secreted heavy 
chain in such pathological immunoglobulins seems to 
stem from the fact that the synthesis of Ig involves a 
chaperoning protein, the immunoglobulin heavy chain 
binding protein or BIP (11), which normally is 
replaced by the light chain (12). It is possible that 
the primordial role of the light chain in the four- 
chain model immunoglobulins is that of a committed 
heavy chain chaperon and that the emergence of light 
chain repertoires has just been an evolutionary bonus. 

The camel id 7 2 and 7 3 chains are considerably 
shorter than the normal mammalian 7 chain. This would 
suggest that deletions have occurred in the C H 1 
domain. Differences in sizes of the 72 and 7 3 
immunoglobulins of old and new world camelids suggests 
that deletions occurred in several evolutionary steps 
especially in the c H l domain. 

II THE HEAVY CHAIN IMMUNOGLOBULINS OF THE CAMELIDS 
LACK THE C w l DOMAIN. 

The strategy followed for investigating the heavy 
chain immunoglobulin primary structure is a 
combination of protein and cDNA sequencing ; the 
protein sequencing is necessary to identify sequence 
streches characteristic of each immunoglobulin. The 
N-terminal of the immunoglobulin being derived from 
the heavy chain variable region repertoire only yields 
information on the V HH subgroups (variable region of 
the heavy chain) and cannot be used for class or 
subclass identification. This means that sequence data 
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had to be obtained from internal enzymatic or chemical 
cleavage sites. 

A combination of papain digestion and Protein A 
affinity chromatography allowed the separation of 
various fragments yielding information on the general 
structure of IgG3 . 

The IgG3 of the camel ( Camelus dromedar ius ) 
purified by affinity chromatography on Protein A 
Sepharose were partially digested with papain and the 
digest was separated on Protein A Sepharose into 
binding and non binding fractions. These fractions 
were analysed by SDS PAGE under reducing and non 
reducing conditions (fig 4). 

The bound fraction contained two components, one 
of 28 Kd and one of 14.4 Kd, in addition to uncleaved 
or partially cleaved material. They were well 
separated by gel electrophoresis ( from preparative 
19% SDS-PAGE gels ) under non reducing conditions and 
were further purified by electroelution ( in 50nM 
amonium bicarbonate, 0.1% (w/v) SDS using a BioRad 
electro-eluter) . After lyophilization of these 
electroeluted fractions, the remaining SDS was 
eliminated by precipitating the protein by the 
addition of 90% ethanol, mixing and incubating the 
mixture overnight at -20 °C (14) . The precipitated 
protein was collected in a pellet by centrifuging 
(15000 rpm, 5min) and was used for protein sequencing. 
N-terminal sequencing was performed using the 
automated Edman chemistry of an Applied Biosystem 477A 
pulsed liquid protein sequencer. Amino acids were 
identified as their phehylthiohydantoin (PTH) 
derivatives using an Applied Biosystem 120 PTH 
analyser. All chemical and reagents were purchased 
from Applied Biosystems. Analysis of the 
chromatographic data was performed using Applied 
Biosystems software version 1.61. In every case the 
computer aided sequence analysis was cofirmed by 
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direct inspection of the chromatograms from the PTH 
analyser. Samples for protein sequencing were 
dissolved in either 50% (v/v) trif luoroacetic 
acid(TFA) (28Kd fragment) or 100% TFA (14Kd fragment). 
Samples of dissolved protein equivalent to 2000 pmol 
(28Kd fragment) or 500 pmol (14Kd fragment) were 
applied to TFA-treated glass fibre discs. The glass 
fibre discs were coated with BioBrene (3mg) and 
precycled once before use. 

N-terminal sequencing of the 28 Kd fragment 
yields a sequence homologous to the N-terminal part of 
7 C H 2 domain and hence to the N-terminal end of the Fc 
fragment. The N-terminal sequence of the 14.4 Kd 
fragment corresponds to the last lysine of a 7 C H 2 and 
the N-terminal end of a 7 C H 3 domain (Table 1) . The 
molecular weight (MW) of the papain fragments and the 
identification of their N-terminal sequences led us to 
conclude that the C H 2 and C H 3 domains of the 73 heavy 
chains are normal in size and that the deletion must 
occur either in the C H 1 or in the V HH domain to 
generate the shorted 73 chain. The fractions which do 
not bind to Protein A Sepharose contain two bands of 
34 and 17 Kd which are more diffuse is SDS PAGE 
indicating that they originate from the variable N- 
terminal part of the molecule (fig 4). 

Upon reduction, a single diffuse band of 17 Kd is 
found indicating that the 34 Kd is a disulfide bonded 
dimer of the 17 Kd component. The 34 Kd fragment 
apparently contains the hinge and the N-terminal 
domain V HH . 

The protein sequence data can be used to 
construct degenerate oligonucleotide primers allowing 
PCR amplification of cDNA or genomic DNA. 

It has been shown that the cells from camel 
spleen imprint cells reacted with rabbit and anti 
camel immunoglobulin sera and that the spleen was 
hence a site of synthesis of at least one 
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immunoglobulin class. cDNA was therefore synthetised 
from camel spleen mRNA. The conditions for the 
isolation of RNA were the following: total RNA was 
isolated from the dromedary spleen by the guanidium 
isothiocyanate method (15). mRNA was purified with 
oligo T-paramagnetic beads, 

cDNA synthesis is obtained using Ipg mRNA template , an 
oligodT primer and reverse transcriptase (BOERHINGER 
MAN) . Second strand cDNA is obtained using RNAse H and 
E coli DNA polymerase I according to the condition 
given by the supplier. 

Relevant sequences were amplified by PCR: 5ng of cDNA 
was amplified by PCR in a 100/il reaction mixture ( 
lOmM Tris-HCl pH 8.3, 50mM KC1, 15mM MgCl 2 , 0.01% 
(w/v) gelatine, 200/xM of each dNTP and 25 pmoles of 
each primer) overlaid with mineral oil (Sigma) . 
Degenerate primers containing EcoRI and Kpn l sites and 
further cloned into pUC 18. After a round of 
denaturing and annealing (94 °C for 5 min and 54 °C for 
5 min) ,2 units of Taq DNA polymerase were added to the 
reaction mixture before subjecting it to 35 cycles of 
amplification:! min at 94 °C (denature) lmin at 54 °C 
(anneal) , 2 min at 72°C (elongate). To amplify DNA 
sequences between V HH and C„2 domains , (#72 clones), 
the PCR was performed in the same conditions with the 
exception that the annealing temperature was increased 
to 60°C. 

One clone examined (#56/36) had a sequence 
corresponding to the N-terminal part of a C H 2 domain 
identical to the sequence of the 28 Kd fragment. The 
availability of this sequence data allowed the 
construction of an exact 3 1 primer and the cloning of 
the region between the N-terminal end of the V HH and 
the C H 2 domain. 

5» primers corresponding to the mouse V HH (16) 
and containing a Xho l restriction site were used in 
conjunction with the 3 1 primer in which a Kpnl site 
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had been inserted and the amplified sequences were 
cloned into pBluescript* . Clone #56/36 which displayed 
two internal Haelll sites was digested with this 
enzyme to produce a probe to identify PCR positive 
clones. 

After amplification the PCR products were checked 
on a 1.2% (w/v) agarose gel. Cleaning up of the PCR 
products included a phenol-chloroform extractio 
followed by further purification by HPLC ( GEN-PAC FAX 
column, Waters) and finally by using the MERMAID or 
GENECLEAN II kit, BIO 101, Inc) as appropriate. After 
these purification steps, the amplified cDNA was then 
digested with EcoRI and Kpnl for series #56 clones and 
with Xhol and Kpnl for series #72 clones. A final 
phenol-chloroform extraction preceded the ligation 
into pUC 18 ( series #56 clones) or into pBluescript R 
(series #72 clones). 

All the- clones obtained were smaller that the 
8 60 base pairs to be expected if they possessed a 
complet V HH and C H 1 region. Partial sequence data 
corresponding to the N-terminal of the V HH region 
reveals that out of 20 clones, 3 were identical and 
possibly not independent. The sequences obtained 
ressemble the human subgroup III and the murine 
subgroups Ilia and Illb (Table 2) . 

Clones corresponding to two different sets of C H 2 
protein sequences were obtained. A first set of 
sequences (#72/41) had a N-terminal C H 2 region 
identical to the one obtained by protein sequencing of 
the 28 Kd papain fragments of the 73 heavy chain, a 
short hinge region containing 3 cysteines and a 
variable region corresponding to the framework (FR4) 
residues encoded by the J minigenes adjoining the 
hinge. The C H 1 domain is entirely lacking. This cDNA 
corresponds to the 73 chain (Table 4) . 

In one closely related sequence (#72/1) the 
proline in position 259 is replaced by threonine. 
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The sequence corresponding to the C H 3 and the 
remaining part of the C H 2 was obtained by PCR of the 
cDNA using as Kpn l primer a poly T in which a Kpn l 
restriction site had been inserted at the 5' end. The 
total sequence of the 7 3 chain corresponds to a 
molecular weight (MW) which is in good agreement with 
the data obtained from SDS PAGE electrophoresis. 

The sequence of this 7 3 chain presents 
similarities with other 7 chains except that it lacks 
the C H 1 domain, the V HH domain being adjacent to the 
hinge. 

One or all three of the cysteines could be 
probably responsible for holding the two 7 3 chains 
together. 

These results have allowed us to define a model 
for the IgG3 molecule based on sequence and papain 
cleavage (fig. 5) . 

Papain can cleave the molecule on each side of 
the hinge disulfides and also between C H 2 and C H 3. 
Under non reducing conditions the V HH domains of IgG3 
can be isolated as disulfide linked dimer or as 
monomer depending on the site of papain cleavage. 

A second set of clones #72/29 had a slightly 
different sequence for the C H 2 and was characterized 
by a very long hinge immediately preceded by the 
variable domain. This hinge region has 3 cysteines at 
its Oterminal end in a sequence homologeous to the 7 3 
hinge. Such second set of clones could represent the 
IgG2 subclass. For the constant part of the 7 3 and 
also for the putative 7 2 , most clones are identical 
showing the 7 2 or 73 specific sequences. A few clones 
such as #72/1 however show minor differences. For 
instance in the case of clones #72/1 two nucleotide 
differences are detected. 

Several V HH regions cDNA's have now been totally 
or partially sequenced with the exception of a short 
stretch at the N-terminal end which is primer derived. 
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Upon translation the majority shows the 
characteristic heavy chain Ser 21 Cys 22 and Tyr 90 Tyr 91 
Cys 92 sequences, of the intra V HH region disulfide 
bridge linking residues 22 and 92. All these clones 
have a sequence corresponding to the framework 4 (FR4) 
residues of the variable region immediately preceding 
the postulated hinge sequence (Table 3). This sequence 
is generated by the J minigenes and is in the majority 
of cases similar to the sequence encoded by the human 
and murine J minigenes. The sequence length between 
region Cys 92 and the C-terminal end of the V HH regions 
is variable and f in the sequences determined, range 
from 2 5 to 37 amino-acids as one might expect from the 
rearrangements of J and D minigenes varying in length. 

Several important questions are raised by the 
sole existence of these heavy chain immunoglobulins in 
a non pathological situation. First of all, are they 
bonafide antibodies ? The heavy chain immunoglobulins 
obtained from trypanosome infected camels react with a 
large number of parasite antigens as shown in part I 
of these examples. This implies that the camelid 
immune system generates an extensive number of binding 
sites composed of single V HH domains. This is 
confirmed by the diversity of the V HH regions of the 
heavy chain immunogobulins obtained by PCR. 

The second question is "how are they secreted ?" . 
The secretion of immunoglobulin heavy chains composing 
four-chain model immunoglobulins does not occur under 
normal conditions. A chaperoning protein, the heavy 
chain binding protein, or BIP protein, prevents heavy 
chains from being secreted- It is only when the light 
chain dispplaces the BIP protein in the endoplasmatic 
reticulum that secretion can occur( 13). 

The heavy chain dimer found in the serum of human 
or mice with the so-called "heavy chain disease" lack 
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the C H 1 domains thought to harbour the BIP site 
(14) .In the absence of thi domain the BIP protein can 
no longer bind and prevent the transport of the heavy 
chains. 

The presence in camels of a IgGl class composed 
of heavy and light chains making up between 25% and 
50% of the total IgG molecules also raises the problem 
as to how maturation and class switching occurs and 
what the role of the light chain is. The camelid light 
chain appears unusually large and heterogeneous when 
examined in SDS PAGE. 

The largest dimension of an isolated domain is 
40 A and the maximum attainable span between binding 
sites of a conventional IgG with C H 1 and V HH will be of 
the order of 160 A (2V HH + 2C H 1) (19) . The deletion of 
C H 1 domain in the two types of heavy chain antibodies 
devoid of light chains, already sequenced has, as a 
result, a modification of this maximum span (fig. 6) . 
In the IgG 3 the extreme distance between the 
extremities of the V HH regions will be of the order of 
80 A (2V HH ) . This could be a severe limitation for 
agglutinating or cross linking. In the IgG2 this is 
compensated by the extremely long stretch of hinge, 
composed of a 12-fold repeat of the sequence Pro-X 
(where X is Gin, Lys or Glu) and located N-terminal to 
the hinge disulfide bridges. In contrast, in the human 
IgG3, the very long hinge which also apparently arose 
as the result of sequence duplication does not 
contribute to increase the distance spanning the two 
binding sites as this hinge is inter-spersed with 
disulfide bridges. 

The single V HH domain could also probably allow 
considerably rotational freedom of the binding site 
versus the Fc domain. 

Unlike myeloma heavy chains which result probably 
from C H 1 deletion in a single antibody producing cell, 
or heavy chain antibodies produced by expression 
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cloning (15); the camelid heavy chain antibodies 
(devoid of light chains) have emerged in a normal 
immunological environment and it is expected that they 
will have undergone the selective refinement in 
specificity and affinity accompanying B cell 
maturation. 

Expression and purification of the camel V H it21 (DR21 
on figure 7) protein from E» col i 

The clones can be expressed in several types of 
expression vectors. As an example using a 

commercially available vector Immuno PBS (Huse et al : 
Science (1989) 246, 1275), clones produced in 
Bluescript ® according to the above described 
procedure, have been recovered by PCR using the same 
Xho l containing 5' primer and a new 3 f primer, 
corresponding to residues 113-103 in the framework of 
the immunoglobulins, in which an Spe site has been 
constructed : TC TTA ACT AGT GAG GAG ACG GTG ACC TG. 
This procedure allowed the cloning of the V HH in the 
Xho/Spe site of the Immuno PBS vector. However, the 
3 1 end of the gene was not in phase with the 
identification M tag n and the stop codon of the vector. 
To achieve this, the construct was cut with Spe and 
the 4 base overhangs were filled in, using the Klenow 
fragment after which the vector was religated. 

The expression vector plasmid ipBS (immunopBS) 
(Stratacyte) contains a pel B leader sequence which is 
used for immunoglobulin chain expression in E. coli 
under the promotor pLAC control, a ribosome binding 
site, and stop codons. In addition, it contains a 
sequence for a c-terminal decapeptide tag. 

E. coli JM101 harboring the ipBS-V HH 21 plasmid was 
grown in 1 1 of TB medium with 100 ixq/ml ampicillin 
and 0.1 % glucose at 32 °C. Expression was induced by 
the addition of 1 mM IPTG (final concentration) at an 
OD 550 of 1.0. After overnight induction at 28 'C, the 
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cells were harvested by centrifugation at 4.000 g for 
10 min (4°C) and resuspended in 10 ml TES buffer (0.2 
M Tris-HCL pH 8.0, 0.5 mM EDTA, 0.5 M sucrose.). The 
suspension was kept on ice for 2 hours. Periplasmic 
proteins were removed by osmotic shock by addition of 
20 ml TES buffer diluted 1:4 v/v with water, kept on 
ice for one hour and subsequently centrifugated at 
12.000 g for 30 min. at 4*C. The supernatant 
periplasmic fraction was dialysed against Tris-HCl pH 
8.8, NaCl 50mM, applied on a fast Q Sepharose flow 
(Pharmacia) column, washed with the above buffer prior 
and eluted with a linear gradient of 50 mM to 1 M NaCl 
in buffer. 

Fractions containing the V HH protein were further 
purified on a Superdex 75 column (Pharmacia) 
equilibrated with PBS buffer (0.01 M phosphate pH 7.2, 
0.15 M NaCl). The yield of purified V HH protein varies 
from 2 to 5 mg/1 cell culture. 

- Fractions were analyzed by SDS-PAGE(I). Positive 
identification of the camel V HH antibody fragment was 
done by Western Blot analysis using antibody raised in 
rabbits against purified camel IgGH 3 and an anti- 
rabbit IgG-alkaline phosphatase conjugate (II) . 

As protein standards (Pharmacia) periplasmic 
proteins prepared from 1 ml of IPTG-induced JMlOl/ipBS 
V HH 21 were used. Figure 8 shows: C, D: fractions from 
fast S Sepharose column chromatography (C: Eluted at 
650 mM NaCl D:Eluted at 700 mM NaCl) E, F: fractions 
from Superdex 7 5 column chromatography. 

As can be seen, the major impurity is eliminated 
by ionexchange chromatography and the bulk of the 
remaining impurities are eliminated by gel filtration. 
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FrameWork 4 J Genes 

Human WGQGTLVTVS S J1,J4,J5 

WGRGTLVTVSS J2 

WGQGTTVTVSS J6 

WGQGTMVTVSS J3 



Murine WGQGTTLTVSS Jl 

WGQGTLVTVSS J2 

WGQGTSVTVSA J3 

WGAGTTVTVSS J4 

cDNA Clones 

Came! WGQGTQVTVS S Clones 

WGQGTQVTVSS #72/19 =#72/3 

WGQGTLVTVSS 1 Clone 

WGRGTQVTVSS #72/24 

WGQGTHVTVSS #72/21 

WGQGIQVTASS #72/16 



Table * 

Comparison of some Framework 4 residues found in the Camel V"h H 
region with the Framework .4 residues corresponding to the 
consensus region of the Human and Mouse J minigenes. 
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CLAIMS 

1. Immunoglobulin characterized in that it comprises 
two heavy polypeptide chains sufficient for the 
formation of a complete antigen binding site or 
several antigen binding sites, this immunoglobulin 
being further devoid of light polypeptide chains. 

2. Immunoglobulin according to claim 1, 
characterized in that it comprises two heavy 
polypeptide chains sufficient for the formation of a 
complete antigen binding site or several antigen 
binding sites, this immunoglobulin being further 
devoid of light polypeptide chains and further 
characterized by the fact that it is the product of 
the expression in a prokaryotic or in a eukaryotic 
host cell, of a DNA or of a cDNA having the sequence 
of an immunoglobulin devoid of light chains as 
obtainable from lymphocytes or other cells of 
Camel ids. 

3. Immunoglobulin according to claim 1 or claim 2, 
characterized in that the amino acid sequence of its 
variable region contains in position 45 an amino acid 
which is different from a leucine, or proline or 
glutamine residue . 

4.. Immunoglobulin according to anyone of claims 1 to 

3, characterized in that its heavy polypeptide chains 
are devoid of a so-called first domain in their 
constant region (C H 1) . 

5. Immunoglobulin according to anyone of claims 1 to 

4, characterized in that it comprises an antigen 
binding site or several antigen binding sites and 
especially in that each variable region of each heavy 
chain contains at least one antigen binding site. 

6. Immunoglobulin according to anyone of claims 1 to 

5, characterized in that it is a type G immunoglobulin 
of class 2 (IgG2) or it is a type G immunoglobulin of 
class 3 (IgG3) . 
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7. Immunoglobulin according to anyone of claims 1 to 

6, characterized in that it is a Camelid immunoglobu- 
lin. 

8. Immunoglobulin according to anyone of claims 1 to 
7 as obtainable by purification from the serum of 
Camelids, characterized in that : 

it is not adsorbed by chromatography on Protein G 
Sepharose column , 

it is adsorbed by chromatography on Protein A 
Sepharose column, 

it has a molecular weight of around 100 Kd after 
elution with a pH 4.5 buffer (0.15 M NaCl, 0.58% 
acetic acid adjusted to pH 4.5 by NaOH) , 
it consists of heavy 72 polypeptide chains of a 
molecular weight of around 45 Kd preferably 46 Kd 
after reduction. 

9. Immunoglobulin according to anyone of claims 1 to 

7, as obtainable by purification from the serum of 
Camelids is characterized in that the immunoglobulin : 

is adsorbed by chromatography on a Protein A 
Sepharose column, 

has a molecular weight of around 100 kd after 
elution with a pH 3.5 buffer (0.15 M NaCl , 0.58% 
acetic acid) , 

is adsorbed by chromatography on a Protein G 
Sepharose column and eluted with pH 3.5 buffer 
(0.15 M NaCl, 0.58% acetic acid). 

consists of heavy 73 polypeptide chains of a 
molecular weight of around 45 Kd in particular 
between 43 and 47 kd after reduction. 

10. Immunoglobulin according to anyone of claims 1 to 
9, characterized in that 

it comprises 4 frameworks in its variable region, 
which frameworks comprise an amino-acid sequence 
selected from the following sequences : 

for the framework 1 domain 
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GGSVQTGGS LeR LSCEISGLTFD 
GGSVQTGGSLRLSCAVSGFSFS 
GGSEQGGGSLRLSCAISGYTYG 
GGSVQPGGSLTLSCTVSGATYS 
GGSVQAGGSLRLSCTGSGFPYS 
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GGSVQTGGS LeR LSCEISGLTFD 
GGSVQTGGSLRLSCAVSGFSFS 
GGSEQGGGSLRLSCAISGYTYG 
GGSVQPGGSLTLSCTVSGATYS 
GGSVQAGGSLRLSCTGSGFPYS 
GGSVQAGGSLRLSCVAGFGTS 
GGSVQAGGSLRLSCVSFSPSS 

for the framework 4 domain 

WGQGTQVTVSS 

WGQGTLVTVSS 

WGQGAQVTVSS 

WGQGTQVTASS 

RGQGTQVTVSL 

and/or, 

for the CDR3 domain 

ALQPGGYCGYGX CL 

VSLMDRISQH GC 

VPAHLGPGAILDLKKY KY 

FCYSTAGDGGSGE MY 

ELSGGSCELPLLF---------DY 

DWKYWTCGAQTGGYF-------GQ 

RLTEMGACDARWATLATRTFAYNY 
QKKDRTRWAEPREW--------NN 

GSRFSSPVGSTSRLES-SDY--NY 
ADPSIYYSILXIEY--------KY 

DSPCYMPTMPAPPIRDSFGW--DD 
TSSFYWYCTTAPY---------NV 

TEIEWYGCNLRTTF--------TR 

NQLAGGWYLDPNYWLSVGAY--AI 
RLTEMGACDARWATLATRTFAYNY 
DGWTRKEGGIGLPWSVQCEDGYNY 
DSYPCHLL--------------DV 

VEYPIADKCS-- --------RY 



and/or, 
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in that its constant region comprises C H 2 and c H 3 
domains comprising an amino-acid sequence 
selected from the following sequences : 

for the C H 2 domain: 

APELLGGPTVFIFPPKPKDVLSITLTP 

APELPGGPSVFVFPTKPKDVLSISGRP 

APELPGGPSVFVFPPKPKDVLSISGRP 

APELLGGPSVFIFPPKPKDVLSISGRP 

for the C H 3 domain: 

GQTREPQVYTLA 

GQTREPQVYTLAPXRLEL 

GQPREPQVYTLPPSRDEL 

GQPREPQVYTLPPSREEM 

GQPREPQVYTLPPSQEEM 

and/ or , 

in that its hinge region comprises from 0 to 50 
amino-acids, especially in that its hinge region 
comprises an amino-acid sequence selected from 
the following sequences : 

GTNEVCKCPKCP 

or, 

EPKIPQPQPKPQPQPQPQPKPQPKPEPECTCPKCP 

11. Immunoglobulin according to anyone of claims 1 to 
10, characterized in that it is encoded by a sequence 
selected among those represented on figure 7. 

12. Fragment of an immunoglobulin according to anyone 
of claims 1 to 11, characterized in that it is 
selected from the following group : 

a fragment corresponding to one heavy polypeptide 
chain of an immunoglobulin devoid of light 
chains, 

fragments obtained by enzymatic digestion of the 
immunoglobulins of the invention, especially 
those obtained by partial digestion with papain 
leading to the Fc fragment (constant fragment) 
and leading to FV HH h fragment (containing the 
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antigen binding sites of the heavy chains) or its 
dimer F(V HH h) 2 , or a fragment obtained by further 
digestion with papain of the Fc fragment, leading 
to the Fc 1 fragment corresponding to the C- 
terminal part of the Fc fragment , 
homologous fragments obtained with other 
proteolytic enzymes, 

a fragment of at least 10 preferably 20 amino 
. acids of the variable region of the 
immunoglobulin, or the complete variable region, 
especially a fragment corresponding to the 
isolated V HH domains or to the V HH dimers linked 
to the hinge disulphide, 

a fragment corresponding to the hinge region of 
the immunoglobulin, or to at least 6 amino acids 
of this hinge region, 

a fragment of the hinge region comprising a 
repeated sequence of Pro-X, 

a fragment corresponding to at least 10 
preferably 20 amino acids of the constant region 
or to the complete constant region of the 
immunoglobulin. 

13. Immunoglobulin according to anyone of claims 1 to 
12, characterized in that all or a part of its 
constant region is replaced by all or part of the 
constant region of a human antibody, 

14. Immunoglobulin according to any one of claims 1 
to 13, obtainable in prokaryotic cells, especially in 
E. coli cells by a process comprising the steps of : 

a) cloning in a Bluecript vector of a DNA or cDNA 
sequence coding for the VH domain of an 
immunoglobulin devoid of light chain obtainable 
for instance from lymphocytes of Camel ids, 

b) recovering the cloned fragment after 
amplification using a 5' primer containing an Xho 
site and a 3' primer containing the Spe site 
having the following sequence 
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TC TTA ACT AGT GAG GAG ACG GTG ACC TG, 

c) cloning the recovered fragment in phase in the 
immuno PBS vector after digestion of the vector 
with Xho and Spe restriction enzymes, 

d) transforming host cells, especially E.coli by 
transfection with the recombinant immuno PBS 
vector of step c, 

e) recovering the expression product of the V HH 
coding sequence, for instance by using antibodies 
raised against the dromadary V HH domain. 

15. Hetero-specif ic immunoglobulins according to any 
one of claims 1 to 13 obtainable by a process 
comprising the steps of: 

obtaining a first DNA or cDNA sequence coding for 
a V HH domain or part thereof having a determined 
specificity against a given antigen and comprised 
between Xho and Spe sites, 

obtaining a second DNA or cDNA sequence coding 
for a V HH domain or part thereof, having a 
determined specificity different from the 
specificity of the first DNA or cDNA sequence and 
comprised between the Spe and EcoRI sites, 
digesting an immuno PBS vector with Eco RI and 
Xho l restriction enzymes, 

ligating the obtained DNA or cDNA sequences 
coding for V HH domains, so that the DNA or cDNA 
sequences are serially cloned in the vector, 
transforming a host cell, especially E.coli cell 
by transfection, and recovering the obtained 
immunoglobul ins . 

16. Immunoglobulin according to any one of claims 1 
to 13, obtainable by a process comprising the steps 
of: 

obtaining a DNA or cDNA sequence coding for a V HH 
domain or part thereof, having a determined 
specific antigen binding site, 
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amplifying the obtained DNA or cDNA, using a 5' 
primer containing an initiation codon and a 
Hindlll site, and a 3 1 primer containing a 
termination codon having a Xho l site, 
recombining the amplified DNA or cDNA into the 
Hindlll (position 2650) and Xho l (position 4067) 
sites of a plasmid pMM984, 

transfecting permissive cells especially NB-E 
cells with the recombinant plasmid, 
controlling the expression, for instance by an 
ELISA assay with antibodies directed against a 
region of a V HH domain and recovering the 
obtained products. 

17. Immunoglobulins according to claim 16, obtainable 
by a process comprising the further cloning of a 
second DNA or cDNA sequence having another determined 
antigen binding site, in the pMM984 plasmid. 

18. immunoglobulin according to any one of claims 11 
to 17, characterized in that it is obtainable by a 
process wherein the vector is Yep 52 and the 
transformed recombinant cell is a yeast especially 
S .cerevisiae . 

19. Immunoglobulin according to any one of claims 11 
to 17, charactized in that it is obtainable by a 
process wherein the vector is a vector appropriate for 
expression in plant cells, for example pMon530, and 
the transformed recombinant cells are plant cells. 

20. Immunoglobulin according to any one of claims 16 
to 17, characterized in that it has a catalytic 
activity, especially in that it is directed against an 
antigen mimicking an activated state of a given 
substrate, these immunoglobulins having for instance 
been modified at the level of their catalytic site by 
random or directed mutagenesis. 

21. Nucleotide sequence, characterized in that it 
codes for all or part of an immunoglobulin according 
to anyone of claims 1 to 20, which immunoglobulins 
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comprise a peptide sequence selected from the 
following: 

VTVSSGTNEVCKCPKCPAPELPGGPSWFWFP , 

VTVSSEPKIPQPQPKPQPQPQPQPKPQPKPEPECTCPKCPAPELLGGPSVFIFP 

GTNEVCKCPKCP 

APELPGGPSVFVFP 

EPKIPQPQPKPQPQPQPQPKPQPKPEPECTCPKCP 
APELLGGPSVFIFP 
APELLGGPTVFIFPPKPKDVLSITLTP 
APELPGGPS VFVFPTKPKDVLS I S GRP 
APELPGGPSVFVFPPKPKDVLSISGRP 
APELLGGPSVFIFPPKPKDVLSISGRP 
GQTREPQVYTLA 
GQTREPQVYTLAPXRLEL 
GQPREPQVYTLPPSRDEL 
GQPREPQVYTLPPSREEM 
GQPREPQVYTLPPSQEEM 
GGSVQTGGSLRLS 
GGSVQTGGSLRLS 
GGSEQGGGSLRLS 
GGSVQPGGSLTLS 
GGSVQAGGSLRLS 
GGSVQAGGSLRLS 
GGSVQAGGSLRLS 
WGQGTQVTVSS 
WGQGTLVTVSS 
WGQGAQVTVSS 
WGQGTQVTASS 
RGQGTQVTVSL 
and/or, 



ALQPGGYCGYGX C L 

VSLMDRISQH GC 

VPAHLGPGAILDLKKY------KY 

FCYSTAGDGGSGE -MY 

ELSGGSCELPLLF- D Y 

DWKYWTCGAQTGGYF -GQ 
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RLTEMGACDARWATLATRTFAYNY 
QKKDRTRWAEPREW--------NN 

GSRFSSPVGSTSRLES-SDY--NY 

ADPSIYYSILXIEY KY 

DSPCYMPTMPAPPIRDSFGW DD 

TSSFYWYCTTAPY NV 

TEIEWYGCNLRTTF T R 

NQLAGGWYLDPNYWLSVGAY AI 

RLTEMGACDARWATLATRTFAYNY 
DGWTRKEGGIGLPWSVQCEDGYNY 

DSYPCHLL DV 

VEYPIADMCS RY 

22. Nucleotide sequence characterized in that it 



codes for an immunoglobulin according to anyone of 
claims 1 to 20 , in that it comprises a sequence 
selected from those represented on figure 7. 

23. Process for the preparation of a monoclonal 
antibody according to anyone of claims 1 to 20, 
directed against a determined antigen, the antigen 
binding site of the antibody consisting of heavy 
polypeptide chains and which antibody is further 
devoid of light polypeptide chains, which process 
comprises : 

immortalizing lymphocytes, obtained for example 
from the peripheral blood of Camelids previously 
immunized with a determined antigen, with an 
immortal cell and preferably with myeloma cells, 
in order to form a hybridoma, 

culturing the immortalized cells formed and 
recovering the cells producing the antibodies 
having the desired specificity. 

24. Process for the preparation of antibodies 
directed against determined antigens, comprising the 
steps of : 

cloning into vectors, especially into phages and 
more particularly filamentous bacteriophages, DNA 
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or cDNA sequence obtained from lymphocytes of 
Camelids previously immunized with determined 
antigens, capable of producing an immunoglobulin 
according to anyone of claims 1 to 20, 
transforming prokariotic cells with the above 

vectors in conditions allowing the production of the 

antibodies, 

selecting the appropriate antibody by subjecting 
the transforming cells to antigen-affinity selection, 

recovering the antibodies having the desired 
specificity. 

25. Process according to claim 24, wherein the 
cloning vector is a plasmid or a eukaryotic virus and 
the transformed cell is a eukaryotic cell, especially 
a yeast cell, mammalian cell, plant cell or protozoair 
cell. 

26. Process according to claim 24, wherein the cloning 
vector is a plasmid capable of expressing the 
immunoglobulin in the bacterial membrane. 

27- Process according to claim 24, wherein the cloning 
vector is a plasmid capable of expressing the 
immunoglobulin as a secreted protein. 

28. Immunoglobulin according to anyone of claims 1 to 
20, characterized in that it is directed against an 
antigen such as one of a bacteria, a virus, a 
parasite, or against a protein, hapten, carbohydrate 
or nucleic acid. 

29. Immunoglobulin according to anyone of claims 1 to 
20 characterized in that it is directed against an 
immunoglobulin idiotype. 

30. Immunoglobulin according to anyone of claims 1 to 
20 characterized in that it is directed against a 
cellular receptor or membrane protein. 

31. Immunoglobulin according to anyone of claims 1 to 
20, characterized in that it has a catalytic activity. 
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32. Immunoglobulin according to anyone of claims 1 to 
20, or a fragment according to claim 12, characterized 
in that it is conjugated with a toxin, 

33. Use for a fragment comprising a repeated sequence 
Pro-X, X being any amino-acid and preferably Gin, Lys 
or Glu, the sequence containing advantageously at 
least 3 repeats of Pro-X and especially a fragment 
composed of a 12 -fold repeat of the sequence Pro-X, 
for coupling protein domains or a protein and a 
ligand. 

34. Use of the hinge region or of a fragment of the 
hinge region of an immunoglobulin according to anyone 
of claims 1 to 20, for coupling protein domains or a 
protein and a ligand. 

35. Immunoglobulin according to anyone of claims 1 to 
20, characterized in that it is a heterospecif ic 
antibody. 

36. Recombinant vector characterized in that it 
comprises a nucleotide sequence according to claim 21 
or claim 22, and in that it is a plasmid, a phage 
especially a bacteriophage, a virus, a YAC, a cosmid. 

37. Recombinant cell or organism characterized in 
that it is modified by a vector according to claim 36. 

38. A cDNA library composed of nucleotide sequences 
coding for a heavy-chain immunoglobulin according to 
anyone of claims 1 to 20, such as obtained by 
performing the following steps: 

a) treating a sample containing lymphoid cells, 
especially periferal lymphocytes, spleen cells, lymph 
nodes or another lymphoid tissue from a healthy 
animal, especially selected among the Camelids, in 
order to separated the B-lymphocytes , 

b) separating polyadenylated RNA from the other 
nucleic acids and components of the cells, 

c) reacting the obtained RNA with a reverse 
transcriptase in order to obtain the corresponding 
cDNA, 
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d) contacting the obtained cDNA with 5' primers 
corresponding to mouse V H domain of four-chain 
immunoglobulins, which primer contains a determined 
restriction site, for example an Xho l site and with 3 1 
primers corresponding to the N-terminal part of a C H 2 
domain, 

e) amplifying the DNA, 

f) cloning the amplified sequence in a vector, 
especially in a bluescript vector, 

g) recovering the clones hybridizing with a probe 
corresponding to the sequence coding for a constant 
domain from an isolated heavy-chain immunoglobulin. 

39. A modified 4-chain immunoglobulin or a fragment 
thereof, the V H regions of which has been partialy 
replaced by specific sequences or amino acids of heavy 
chain immunoglobulins according to anyone of claims 1 
to 20. 

40. A modified 4-chain immunoglobulin or a fragment 
thereof, according to claim 39, wherein the leucine, 
proline or glutamine in position 4 5 of the V H regions 
has been replaced by other amino acids and preferably 
by arginine, glutamic acid or cysteine. 

41. A modified 4-chain immunoglobulin or a , fragment 
thereof, in which the CDR loops of the region are 
linked to other parts of the V region by the 
introduction of paired cysteines, in particular in 
which the CDR 3 loop is linked to the FW 2 or CVR, and 
more especially where the cysteine of the CDR 3 of the 
V H is linked to a cysteine in position 31 or 33 of FW 2 
or in position 45 of CDR 2 . 
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ATTTCCCAAGACAACGCCAAGAATACGGTATATCTGCAAATG^JSCTTCCT 
ATCTCCCACGACAACGCCAAGAACACGCTGTATCTGCAAATGCGCAACCT 
ATCTCCCGAGACAATTCCAGGAACACAGTGTATCTGCAAATq^ACAGCCT 

GAAACCTGAGGACACGGCG ATGTATTACTGTAAAAC - A G^OCCTAC — 

GAAACCTGAGGACACGGCGATGTATTACTGTAAAAT-A GU--TTC- - 

AACCCCTGAAGACACGGCTACGTATTACTGTGCGGCGG TtCCAGCCC 

GCAACCTGAGGATACGGCCACCTATTACTGCGCGGCAA GzJCTGACGG 

GCAACCTGAGGATACGGCCACCTATTACTGCGCGGCAA G&CTGACGG 

GAAACCTGAGGACACGGCCATCTACTTCTGTGCAGCAG Gj- : CTC 

GACACCTGAGGACACGGCCATCTACAGTTGTGCGGCAA Cl- CAG 

GAAACCTGAAGAC ACGGGC ACCTATTACTGTGCTG -C A G^ACTAAGT 

GAAACCTGAGGACACGGCCATCTATTACTGCGCGG-CG G5UTAGTCCA 

GAAACCTGAGGACAGCGCCATGTACTACTGTGCAATCA QTCAAATTG 

GAAACCTGAGGACACGGCCATGTATTACTGCGCGGTAGATGGjitTGGACCC 

CAAACCTGAGGACACGGCCATGTATTACTGTGCG f? CC 

GAACCCGGAGGACACTGCCGACTACTACTGCGCTGCAAATCAAJTTAGC- 
GAGGCCTGAGGACACGGCCGTGTATTACTGTGCGGCAGATTG-i- 
GAAACCTGAGG AC ACGGCC ATCTATTACTGTGCGGCAG - - • 

GCAACCTGACGACACTGGCGTGTACTACTGTGCGGCC 

GAAACCCGAGGACACGGCCGTGTATTACTGTGGGGCAGT- 



CGGACCC 
■-CAA 



A- AC — CTGGGGGTTATTGfTGGGTA- 

GTAC — CCGTGCC ATCTCCTTGATG - 

ACTTGGGACCT GGCG -CC ATT CjTTGATTTG 

AGATGGGGGCTTGTGATGCGAGATGGGCGACCTTAGC - -GACAAGGAC-G 
AGATGGGGGCTTGTGATGCGAGATGGGCGACCTTAGC — GACAAGGAC-G 

GCGTTTTT-CTAGTCCTGTTGGGAGCACTTC-TAGAC TCGAAAGTAG 

TAGTTTTTACTGGTACT GCAC C ACS G 

GGTGGTAGTTGTGAATTGC CTTTGC -TATTTGACTA 

TGTTACATGCCGACTATGC CCGCTCCCCCGATACGAGACAGTTTTGG 

AGTGGTATGGGTGCAATTT AAGGACTACTTTTACT Q G 

GGAAGG AAG - -GGGGAATCGGGTTAC CCTGGTCGGTCCAATGTGAA 

GGTTGAA TATC C TATTGC AG AC - ; - ATGTGTT 

TGGTGGCTGGTATT TGGACCCGAATTACTGG-QTCTCTGTG 

GAAATACTGGA CTTGTGGTGC- -CCAGA-CTGG — AG 

AAGTATATATTATAGTATC CTCCNNAT 

AAGAAGG ATCGTA CTAG ATGGGC CG AGCC T 

CTCCCTAA — TGGAC'CGAATTTC 

I 

--TGGGTANTGCCTCTGGGGCCAGGGGACCCAGGTCACCGTCiTCCTCACT 

- -T- CTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

AAAAAGTATAAGTACTGGGGCCAGGGGACCCAGGTCACCGTCrcCTCACT 
TTTGCGTATAACTACTGGGGCCGGGGGACCCAGGTCACCGTCiirCTCACT 
TTTGCGTATAACTACTGGGGCCGGGGGACCCAGGTCACCGTCjTCCTCACT 
CGA-CT-ATAACTATTGGGGCCAGGGGATCCAGGTCACCGTCACCTCACT 
CGC-CTTATAACGTCTGGGGTCAGGGGACCCAGGTCACCGTCTCCTCACT 

CTGGG GCCAGGGCACCCAGGTCACCGTQICCTCACT 

CTGGGATGATTTT GGCCAGGGGACCCAGGTCACCGTC TCCTC ACT 

CTGGG GCCAGGGGACCCAGGTCACCGTCTCCTCACT 

G ATGGTTATAACTATTGGGGCCAGGGGACCCAGGTCACCGTC rCCTCAC - 

CGAGAT ACG GCGACCCGGGGACCCAGGTCACCGTC TCCTCAC - 

GGTGCATATGCCATCTGGGGCCAGGGGACCC AGGTCACCGTC TCCTCAC - 
G ATACTTCGG AC AG - TGGGGTCAGGGGGCCC AGGTC ACCGTC TCCTC ACT 
- -TGAGTATAAGTACTGGGGCCAGGGGACCCAGGTC ACCGTC 3XCTCA- - 
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AGGTGA- 



-AGGTGA- 



- TCTGGGGG AGG 
- TCTGGGGG AGG 
- TCTGGAGGAGG 
- TCTGGGGGAGG 
-TCTGGGGGAGG 
-TCTGGAGGAGG 
-TCTGGAGGAGG 
-TCTGGGGGAGG 
-TCTGGGGGAGG 
-TCAGGGGGAGG 
-TCTGGGGGAGG 
-TCTGGAGGAGG 
-TCTGGGGGAGG 
-TCTGGGGGAGG 
-TCTGGGGGAGG 
-TCAGGGGGAGG 

CTCGAGTCAGGTGTCCGGTCTGATGTGCAGCTGGTGGCGTCTGGGGGAGG 



C- 
C- 
C- 
C- 
C- 
C- 

c- 
c- 
c- 
c- 
c- 
c- 
c- 
c- 
c- 
c- 



-AGGTGA- 



TCGAG-- 

TCGAG - 

- AACTGCTCGAG - - 

TCGAG— 

-AACTGCTCGAG— 

TCGAG— 

TCGAG— 

TCGAG— 

TCGAG-- 

TCGAG— 

TCGAG— 

TCGAG— 

TCGAG— 

-AACTGCTCGAG— 

TCGAG-- 

TCGAG— 



DRO 1 0 0 6 ATCGGTGCAGGCTGGAGGGTCTCTGAGACTCTC— GTGCG -CAGCCTCTG 

DR2 7006 CTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTGCATCTTCTTCTA 

DRO 3 0 0 6 CTCGGTGCAG ACTGGAGGATCTCTGAGACTCTCCTGTGCAGT- -C -TCTG 

DR1 1006 GTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTAATGT- -C -TCTG 

DR2 4006 GTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTAATGT- -C -TCTG 

DR16006 CTCGGCGCAGGCTGGAGGATCTCTGAGACTCTCCTGTGCAGC — CCACGG 

DR1 9006 CTCGGTTCAGGCTGGAGGGTCCCTTAGACTCTCCTGTGCAGC- -C -TCTG 

DRO 7 0 06 CTCGGTGCAGGGTGGAGGGTCTCTGAGACTCTCCTGTGCAA TCTCTG 

DR 1 6 0 0 6 CTCGGTGCAGGCTGG AGGGTCTCTGAGACTCTCCTGTACAG GCTCTG 

DR2 0006 CTCGGTAC AGGTTGG AGGGTCTCTGAGACTCTCCTGTGTAG CCTCTA 

DR2 5 0 0 6 CTCGGTACAAACTGGAGGGTCTCTGAGACTCTCTTGCG AAATCTCTG 

DR2 0006 CTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTG TAGCCTCTG 

DR2 1006 CTCGGTGCAGGTTGGAGGGTCTCTGAAACTCTCCTGTAAAAT CTCTG 

DRO 9006 CTCGGTGCAGGCTGGGGGGTCTCTGACACTCTCTTGTG TATACAC - - 

DR1 7006 CTCGGTCCAACCTGG AGGATCTCTGACACTCTCCTGTACAGTT TCTG 

DR13006 CTCGGTGGAGGCTGGAGGGTCTCTGAGACTCTCCTGTACAG CCTCTG 

DRO 2 0 0 6 CTCGGTGCAGGCTGGAGGCTCTCTGAGACTCTCCTGTACAG CCTCTG 
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GA - - TAC AGTAATT 
AA - - TATATGCCTT - 
GA- - TTCTCCTTTA - 
GC - - TCTCCCAGTA - 
GC --TCTCCCAGTA - 
GA--TTCCGC-TCA- 
AC - - TAC ACCATC A - 
G A - - TAC ACGTACG - 
GA--TTCCCCTATA- 
CT- -CACACCGACA - 
G A - - TTG ACTTTTG - 
GA- - TTC AATTTCG - 



- - GTCCCC.TC ACTTG 
- -GCACCTACGACAT- 
- -GTACCAGTTGTAT 
- -GTACTTATTGCCT 
- - GTACTTATTGCCT 
- - ATGGTTACTACAT 
- - CTG ATTATTGCAT 
- -GTAGCTTCTGTAT 
- -GTACCTTCTGTCT 
- - GTAGC ACCTGTAT 

- ATG ATTCTGACGT 

- AAACTTCTCGTAT 



•GAGCTGGTATCGCCAGTTT 
-GACCTGGTACCGCCAGGCT 
•GGCCTGGTTCCGCCAGGCT 
•GGGCTGGTTCCGCCAGGCT 
•GGGCTGGTTCCGCCAGGCT 
•CGCCTGGTTCCGTCAGGCT 
•GGCCTGGTTCCGCCAGGCT 
-GGGCTGGTTCCGCGAGGGT 
-GGGGTGGTTCCGCCAGGCT 
-AGGCTGGTTCCGCCAGGCT 
-GGGGTGGTACCGCCAGGCT 
-GGCGTGGTACCGCCAGACT 



GAGGTACCCCAGATCGTGTTCCTAAATCTTTGGCCTGGTTCCGCCAGGCT 
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DRO 9006 CAACGATACTGGGACCA TGGGATGGTTTCGCCAGGCT 

DR1 7006 - -GGGCCACCTACA GTGACTACAGTATTGGA-TGGATCCGCCAGGCT 

DR13006 G ATACGTAT-CCT CTATGGCCTGGTTCCGCCAGGTT 

DR02006 GAGA CAGTTTCAGTAGATT--TGCCATGTCTTGGTTCCGCCAGGCT 



DR 0 1 0 0 6 CC AGGAACGGAGCGCG AGTTCGTCTCCAGTATGG ATCCGGATGGAAATAC 

DR2 7006 CCAGGCAAGGAGCGCGAATTTGTCTCAAGTATAAATATTGATGGTAAGAC 

DR 0 3 0 0 6 TCAGGAAAGCAGCGTGAGGGGGTCGCAGCCATTAATAGTGGCGGTGGTAG 

DR1 1006 CCAGGGAGGGAGCGTGAGGGGGTCACAGCGATTAA CACTGATGG 

DR2 4006 CCAGGGAAGGAGCGTGAGGGGGTCACAGCGATTAA CACTGATGG 

DR1 6006 CCTGGGAAGGGGCGTGAGGGGGTCGCAACAATTAATGGTGGTCG 

DR1 9 0 0 6 CCAGGGAAGGAGCGTGAATTGGTCGCAGCGATTCAAGTTGTCCGTAGTGA 

DR07 00 6 CCAGGCAAGGAACGTGAGGGGATCGCAACTATTCTTAATGGTGGTACTAA 

DR1 6 0 0 6 CCAGGGAAGGAGCGTGAGGGGGTCGCGGGTATTAATAGTGCAGGAGGTAA 

DR2 0006 CC AGGGAAGGAGCGCGAGGGGGTCGCAAGTATATATTTTGGTGATGGTGG 

DR2 5006 CCAGGGGATGAGTGCAAATTGGTCTCAGGTATTCTGAGTGATGGTACT-C 

DR20OO6 CCAGGAAATGTGTGTGAGTTGGTCTCAAGTATTTACAGTGATGG 

DR2 1 0 0 6 CCAGAGAAGGAGCGCGAGGGGATCGCAGTTCTTTCGACTAAGGATGGTAA 

DRO 9 0 0 6 CCAGGGAAAGAGTGCGAAAGGGTCGCGCATATTACGCCTGATGGTATGA- 

DR1 7 0 0 6 CCAGGGAAGGACCGTGAAGTAGTCGCAGCCGCTAATACTGGTG 

DR1 3006 CCAGGGCAGGAGCGCGAGGGGGTCGCGTTTGTTCAAACGG 

DR02 0 0 6 CCAGGGAAGGAGTGCGAATTGGTCTCAAGCATTCAAAGTAATGGAAGGAC 



DRO 1006 CAAGTACA CATACTCCGTGAAGGGCCGCTTCACC 

DR27 006 AACATACG CAGACTCCGTGAAGGGCCGATTCACC 

DR03 00 6 GACATACTA-CAACACATATGTCGCCGAGTCCGTGAAGGGCCGATTCGCC 

DR1 1006 CAGTATCAT- ATACGCA GCCGACTCCGTGAAGGGCCGATTCACC 

DR2 4006 CAGTGTCAT- ATACGCA GCCGACTCCGTGAAGGGCCGATTCACC 

DR1 6006 CGA - CGTC AC ATACTACGCCG ACTCCGTGACGGGCCGATTTACC 

DR1 9006 TACT- -CGC -C-TCACAGACTACGCCGACTCCGTGAAGGGACGATTCACC 

DR07006 CACATACTATGCCGACTCGGTGAAGGGCCGATTCACC 

DR16006 TACTTACTATGCCGACGCCGTGAAGGGCCGATTCACC 

DR2 0006 TACGAATTATCGCGACTCCGTGAAGGGCCGATTCACC 

DR2 500 6 CATATAC AAAGAGTGGAGACTATGCTGAGTCTGTGAGGGGCCGGGTTACC 

DR2 00 0 6 CA-AAACATACTACGTCGACC- -GCA- TGAAGGGCCGATTCACC 

DR2 1006 GA CATTCTATGCCGACTCCGTGAAGGGCCGATTCACC 

DRO 9006 CCTTCATTGATGAACCCGTGAAGGGGCGATTCACG 

DR17006 CGACTAGTAAATTCTACGTCGACTTTGTGAAGGGCCGATTCACC 

DR1 3 0 0 6 - -CTGACAAT- AGTGCATTATATGGCGACTCCGTGAAGGGCCGATTCACC 

DR 0 2 0 0 6 AACTGA GGCCG ATTCCGTGCAAGGCCG ATTC ACC 



DRO 1 0 0 6 ATGTCCCGAGGCAGCACCGAGTACACAGTATTTCTGCAAATGGACAATCT 

DR2 7 006 ATCTCCC AAGAC AGCGCCAAGAACACGGTGTATCTGC AGATGAACAGCCT 

DRO 3 0 0 6 ATCTCCCAAGAC AACGCCAAGACCACGGTATATCTTG ATATGAACAACCT 

DR1 1 00 6 ATCTCCC AAG AC ACCGCCAAGGAAACGGTACATCTCC AG ATGAACAACCT 

DR2 4006 ATCTCCC AAG AC ACCGCC AAGAAAACGGTATATCTCC AG ATGAACAACCT 

DR1 6 0 0 6 ATCTCCCGAG ACAGCCCCAAGAATACGGTGTATCTGCAGATGAACAGCCT 

DR1 9 0 0 6 ATCTCCCAAGGCAACACCAAGAACACAGTGAATCTGCAAATGAACAGCCT 

DR07 0 0 6 ATCTCCCAAGACAGCACGTTGAAGACGATGTATCTGCTAATGAACAACCT 

DR1 6 0 0 6 ATCTCCC AAGGGAATGCCAAGAATACGGTGTTTCTGC AAATGGATAACTT 

DR2 0 0 0 6 ATCTCCC AACTCAACGCCCAGAACACAGTGTATCTGCAAATGAACAGCCT 

DR2 5006 ATCTCCAGAGACAACGCCAAGAACATG ATATACCTTCAAATGAACGACCT 

DR2 0 0 0 6 ATTTCTAGAG AG AATGCCAAGAATACATTGTATCTACAACTG AGCGGCCT 

DR2 1 0 0 6 ATCTTCTTAGATAATGACAAGACCACTTTCTCCTTACAACTTGATCGACT 

DR09006 ATCTCCCGAG ACAACGCCCAGAAAACGTTGTCTTTGCGAATGAATAGTCT 
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DR1 7 006 ATTTCCCAAGACAACGCC AAG AATACGGTATATCTGCAAATGAGCTTCCT 
DR13006 ATCTCCCACGACAACGCC AAGAACACGCTGTATCTGCAAATGCGCAACCT 
DR02 00 6 ATCTCCCGAGACAATTCCAGGAACACAGTGTATCTGCAAATGAACAGCCT 



DR01006 GAAACCTGAGGAC ACGGCG ATGTATTACTGTAAAAC -A GCCCTAC - - 

DR27 0 0 6 GAAACCTGAGGACACGGCGATGTATTACTGTAAAAT-A GA- -TTC- - 

DRO 3 0 0 6 AACCCCTGAAGAC ACGGCTACGTATTACTGTGCGGCGG TCCCAGCCC 

DR1 1006 GCAACCTGAGGATACGGCCACCTATTACTGCGCGGCAA GACTGACGG 

DR2 4 006 GCAACCTGAGGATACGGCCACCTATTACTGCGCGGCAA GACTGACGG 

DR1 6 0 0 6 GAAACCTGAGGAC ACGGCCATCTACTTCTGTGCAGCAG G CTC 

DR1 9006 GACACCTGAGGACACGGCCATCTACAGTTGTGCGGCAA C CAG 

DR07 006 GAAACCTGAAGAC ACGGGCACCTATTACTGTGCTG -CA GAACTAAGT 

DR16006 GAAACCTGAGGAC ACGGCCATCTATTACTGCGCGG - CG GATAGTCCA 

DR2 0006 GAAACCTGAGGACAGCGCCATGTACTACTGTGCAATCA CTGAAATTG 

DR2 5006 GAAACCTGAGGACACGGCCATGTATTACTGCGCGGTAGATGGTTGGACCC 

DR2 00 0 6 CAAACCTGAGGACACGGCCATGTATTACTGTGCG CC 

DR2 1 OT) 6 GAACCCGGAGGACACTGCCGACTACTACTGCGCTGCAAATCAATTAGC - - 

DRO 9 0 0 6 GAGGCCTGAGGACACGGCCGTGTATTACTGTGCGGCAGATTG 

DR17 0 0 6 GAAACCTGAGGACACGGCCATCTATTACTGTGCGGCAG CGGACCC 

DR13006 GCAACCTGACGACACTGGCGTGTACTACTGTGCGGCC CAA 

DRO 2 0 0 6 GAAACCCGAGGACACGGCCGTGTATTACTGTGGGGCAGT 



DR01006 A- AC — CTGGGGGTTATTGTGGGTA - 

DR27006 GTAC — CCGTGCCATCTCCTTGATG - 

DR03006 • ACTTGGGACCT - GGCG-CCATT CTTGATTTG 

DR1 1006 AGATGGGGGCTTGTGATGCGAGATGGGCGACCTTAGC- -GACAAGGAC-G 

DR24006 AGATGGGGGCTTGTGATGCGAGATGGGCGACCTTAGC — GACAAGGAC-G 

DR16006 GCGTTTTT-CTAGTCCTGTTGGG AGCACTTC -TAGAC TCGAAAGTAG 

DR19006 TAGTTTTTAC TGGTACT GCAC C ACG G 

DRO 70 0 6 GGTGGTAGTTGTGAATTGC CTTTGC TATTTGACTA 

DR1 6006 TGTTACATGCCGACTATGC CCGCTCCCCCGATACGAGACAGTTTTGG 

DR2 0006 AGTGGTATGGGTGCAATTT AAGG AC TAC TTTTACT C G 

DR2 5006 GG AAGG AAG- -GGGGAATCGGGTTAC CCTGGTCGGTCCAATGTGAA 

DR2 000 6 GGTTGAA TATC C TATTGC AG AC - - ATGTGTT 

DR2 1006 TGGTGGCTGGTATT TGGACCCGAATTACTGG - CTCTCTGTG 

DR09 0 06 GAAATACTGGA CTTGTGGTGC - -CCAGA-CTGG AG 

DR 1 7 0 0 6 AAGTATATATTATAGTATC CTCCNNAT 

DR 1 3 0 0 6 AAGAAGG ATCGTA CTAGATGGGC CG AGCCT 

DR02006 CTCCCTAA- -TGGACCGAATTTC 



DR01006 -- TGGGTANTGCCTCTGGGGCCAGGGGACCC AGGTCACCGTCTCCTCACT 

DR2 7006 --T CTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DRO 3 0 0 6 AAAAAGTATAAGTACTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR1 1 0 0 6 TTTGCGTATAACTACTGGGGCCGGGGGACCCAGGTCACCGTCTCCTCACT 

DR2 4 006 TTTGCGTATAACTACTGGGGCCGGGGGACCCAGGTCACCGTCTCCTCACT 

DR1 600 6 CGA-CT-ATAACTATTGGGGCCAGGGGATCCAGGTCACCGTCACCTCACT 

DR1 9006 CGC -CTTATAACGTCTGGGGTCAGGGGACCC AGGTCACCGTCTCCTCACT 

DRO 7 0 0 6 CTGGG GCCAGGGCACCC AGGTCACCGTCTCCTCACT 

DR1 6006 CTGGGATGATTTT GGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR20006 CTGGG GCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR2 5006 G ATGGTTATAACTATTGGGGCCAGGGGACCC AGGTC ACCGTCTCCTCAC - 

DR2 0006 CGAGAT ACG GCGACCCGGGGACCCAGGTCACCGTCTCCTCAC- 

DR2 1 0 0 6 GGTGCATATGCC ATCTGGGGCC AGGGGACCC AGGTC ACCGTCTCCTCAC - 

DRO 9 0 0 6 G ATACTTCGG AC AG - TGGGGTCAGGGGGCCC AGGTCACCGTCTCCTCACT 

DR1 7 0 06 - -TGAGTAT AAGTACTGGGGCC AGGGGACCC AGGTC ACCGTCTCCTCA- - 
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DR1 3 0 0 6 CGAGAATGGAACAACTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCA- 
DR02 0 0 6 CCAACATGGG - -TGCCGGGGCCAGGGAACCCAGGTCACCGTCTCCT- - - 



DR 01006 AG TTACCCGTACGACGTTCCGG ACTACGGTTCTTAATAG AATTC 

DR27 0 06 AG TTACCCGTACGAGCTTCCGGACTACGGTTCTTAATAGAATTC 

DRO 3 0 0 6 AGCTAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR1 1006 AG TTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR2 4 0 0 6 AGCTAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR1 6006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR19006 AG TT ACCCG TAC G ACGTTCCGG ACTAC GG TTCTTAAT AG AATTC 

DR07006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR16006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR2 0006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR25006 TAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR2 0006 TAGTTACCCGTACGACGAACCGGACTACGGTTCTTAATAGAATTC 

DR21006 TAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DRO 9 0 J 6 AGCTAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR17006 _ 

DR13006 

DR02006 TA 
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